Www.sti-innsbruck.at © Copyright 2009 STI INNSBRUCK www.sti-innsbruck.at Web of Hypertext (RDFa, Microformats) and Web of Data Semantic Web Lecture (WS.

Slides:



Advertisements
Similar presentations
Semantic Descriptions for RESTful Services SA-REST by Knoesis Service Research Lab Tomas Vitvar WSMO Phone Conference January 09,
Advertisements

Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Structured Data on the Web Introduction to.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Gleaning Resource Descriptions from Dialects of Languages (GRDDL) W3C Team Submission 16 May 2005 Dominique Hazaël-Massieux, Dan Connolly Summarized by.
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
RDF Tutorial.
Semantic Web Introduction
MICROFORMATS Ioana B ă rb ă nan Semantic Web developer.
Faculty of Electrical Engineering University of Belgrade Predrag Radenković 10/3237 Predrag Radenković 3237/10.
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
Sematic Web Microdata, Microformat and RDF Advanced Web-based Systems | Misbhauddin.
1 © Copyright 2010 Dieter Fensel and Tobias Buerger Semantic Web Web of Data.
© Copyright 2009 STI INNSBRUCK Semantic Web Web of Hypertext (RDFa, Microformats) and Web of Data Dieter Fensel.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
© Copyright 2009 STI INNSBRUCK Semantic Web Web of Hypertext (RDFa, Microformats) and Web of Data Dieter Fensel.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
The Web of Linked Data Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Linking Open Data Linking the world of data from LOD mailinglist Acknowledgement for Tom Heath (Talis) Ying Ding
Shared innovation How to Publish Linked Data on the Web Dr. Tom Heath Platform Division Talis Information Ltd
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Linking Open Data Linking the world of data from LOD mailinglist Acknowledgement for Tom Heath (Talis) Ying Ding
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
Logics for Data and Knowledge Representation
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
The Semantic Web and Microformats. The Semantic Web Syntax = how you say something – Letters, words, punctuation Semantics = meaning behind what you say.
Shared innovation An Introduction to Linked Data Dr Tom Heath Platform Division Talis Information Ltd 13/14.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Christian Bizer: The Web of Linked Data (26/07/2009) SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 The Emerging Web of.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad An Introduction to Linked Data, Its Applications and Challanges.
Microformats Randy Schauer CMSC 691M. What are Microformats? “Designed for humans first and machines second, microformats are a set of simple, open data.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
Semantic Web - an introduction By Daniel Wu (danielwujr)
RDFa, Microformats, and Atom Semantic Web Presented by: Anuradha Kandula Instructor: Steven Seida.
Microsearch and SearchMonkey Interfaces for Semantic Search Peter Mika Researcher, Data Architect Yahoo! Research.
Semantic Web Technologies Brief Readings Discussion Class work: Research topics and Project discussion Research Presentation Topics assigned Building lightweight.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
Semantic Clipboard User Interface is integrated in the Browser Architecture of the Semantic Clipboard Illustration of a license incompliant content reuse.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
© Copyright 2009 STI INNSBRUCK Web of Hypertext (RDFa, Microformats) and Web of Data Semantic Web Lecture IV.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Information Sharing on the Social Semantic Web Aman Shakya* and Hideaki Takeda National Institute of Informatics, Tokyo, Japan The Second NEA-JC Workshop.
Semantic Web in Depth RDFa, GRDDL and POWDER Dr Nicholas Gibbins
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Semantic Web WS 2016/17 Web of Data Anna Fensel
Microformats, RDFa, Microdata, GRDDL and Linked Data
RDFa How and Why Ralph R. Swick World Wide Web Consortium
Embedding Knowledge in HTML
Cataloging the Internet
Embedding Knowledge in HTML
Presentation transcript:

© Copyright 2009 STI INNSBRUCK Web of Hypertext (RDFa, Microformats) and Web of Data Semantic Web Lecture (WS 09/10) Dieter Fensel and Tobias Bürger

Agenda Introduction: From Hypertext to the Web of Data “Building” the Web of Data –Embedding Structured Information in Web Pages Standards: Microformats, RDFa, GRDDL Example: Yahoo SearchMonkey “Feeding the Monkey with RDFa and Microformats” –Linked Data Principles Publishing and Consuming Linked Data Adding Legacy Data to the Web of Data Examples Current Developments Wrap Up 2

Evolution of the Web Hypertext Hypermedia Web Web of Data Social Web (Web 2.0) Semantic Web Picture from [1] ? Picture from [2] “As We May Think”

Motivation: From a Web of Documents to a Web of Data Web of DocumentsWeb of Data “Documents” “Things” Hyperlinks Typed Links

Vision of the Web of Data Many common things are represented in multiple data sets Linking identifiers links these data sets The Web of Data is envisioned as a global database –consisting of objects and their descriptions –in which objects are linked with each other –with a high degree of object structure –with explicit semantics for links and content –which is designed for humans and machines Content on this slide by Chris Bizer, Tom Heath and Tim Berners-Lee

How to “Build” the Web of Data? Publish structured data by –using Web (2.0) APIs –embedding structured information (Microformats, RDFa, GRDDL) –linking data [4] [5] [6] [7] [3]

Microformats Recommended literature: [5], [8]

What are Microformats? “Designed for humans first and machines second, microformats are a set of simple, open data formats built upon existing and widely adopted standards. Instead of throwing away what works today, microformats intend to solve simpler problems first by adapting to current behaviours and usage patterns (e.g. XHTML, blogging).” [5]

What are Microformats? /2 Are highly correlated with semantic (X)HTML / “Real world semantics” / “Lowercase Semantic Web” [9]. A way to combine human with machine-readable information. Provide means to embed structured data in HTML pages. Build upon existing standards. Solve a single, specific problem (e.g. representation of geographical information, calendaring information, etc.). Provide an “API” for your website. Build on existing (X)HTML and reuse existing elements. Work in current browsers. Follow the DRY principle (“Don’t Repeat Yourself”). Compatible with the idea of the Web as a single information space.

Microformats Illustrated Content on this slide by Chris Griego

Design Patterns Elemental microformats –Rel-home links to homepage –Rel-License links to content license cc by2.0 –Others: rel-tag, rel-encluse, xfn-tags Compound microformats –E.g. hCard, hCalendar, hEvent, hReview Picture from [5]

Example Compound Microformat: hCard hCard is a simple format for representing people, companies, organizations, and places, using a 1:1 representation of the properties and values of the vCard standard (RFC2426). BEGIN: VCARD VERSION: 3 FN: Dieter Fensel ORG: STI Innsbruck … URL: TEL: END: VCARDhttp:// Example on this slide by Alexander Graf

Example Compound Microformat: hCard /2 hCard is a simple format for representing people, companies, organizations, and places, using a 1:1 representation of the properties and values of the vCard standard (RFC2426). Dieter Fensel STI Innsbruck mail me Phone: Example on this slide by Alexander Graf

Drawbacks of Microformats Only a fixed set of microformats exist. No way to connect data elements. Fixed vocabulary, not extendable and customizable. Separate parsing rules for each microformat needed.

RDFa “RDFa is microformats done right” (Bob DuCharme) Recommended literature: [6], [10]

RDFa Goals: Bringing the Web of Documents and the Web of Data closer together. Both for human and machine consumption. Follows the DRY (“Don’t Repeat Yourself”) – principles. RDFa is a serialization syntax for embedding an RDF graph into XHTML. RDFa is domain-independent. In contrast to the domain-dedicated microformats, RDFa can be used for custom data and multiple schemas. Benefits inherited from RDF: Independence, modularity, evolvability, and reusability. Easy to transform RDFa into RDF data. Tools for RDFa publishing and consumption exist [14]. Earlier comparable approaches: SHOE [11], HTML-A [12], and eRDF [13].

RDFa – Usage Example RDFa uses (X)HTML attributes to express metadata within (X)HTML (used attributes: see [10]). Example: Embedding FOAF into HTML using RDFa Dieter Fensel Tobias Bürger Tobias Bürger knows Dieter :. :dieter a foaf:Person; foaf:name “Dieter Fensel”. :tobias a foaf:Person; foaf:name “Tobias Bürger” foaf:knows :dieter.

GRDDL (“Gleaning Resource Descriptions from Dialects of Languages”) Recommended literature: [15], [16], [17]

What is GRDDL? GRDDL is a technique for obtaining RDF data from XML documents (a GRDDL transformation). It is a means to associate transformations (preferably expressed in XSLT) with an individual document. GRDDL applied in 3 steps: (1) Declaration of a document as the source. (2) Link to one or more extractors. (3) GRDDL agent extracts RDF from the document. Figure from Daniel Hazael-Massieux.

Use Case Scheduling: Jane is Coordinating a Meeting Robin publishes his schedule on his home page using the hCalendar microformat. David publishes his in Embedded RDF using some RDF calendar properties. Kate uses a blog engine that encodes her diary as RDFa. Jane uses an online calendaring service that publishes an RSS 1.0 feed of her schedule. Example from [17]

SearchMonkey: Making use of RDFa and Microformats in Search Recommended literature: [18], [19], [20] Slides about SearchMonkey by E. Goar and P. Tarjan (Yahoo)

What is the SearchMonkey? an open platform for using structured data to build more useful and relevant search results BeforeAfter

Enhanced Search Result Key/value Pairs or abstract LinksImage

How does it Work? Acme.com’s DB Index RDF/Microformat Markup site owners/publishers share structured data with Yahoo! 1 consumers customize their search experience with Enhanced Results or Infobars 3 site owners & third-party developers build SearchMonkey apps 2 DataRSS feed Web Services Page Extraction Acme.com’s Site

Feeding the Monkey: Data Sources (e.g., RDFa and Microformats) NameCachedOpenModeNotes Yahoo! Indexyes PassiveOld-School Y! Index data RDFa, eRDFyes PassiveVocab + markup decoupled Microformatsyes PassiveVocab + markup coupled DataRSS feedyesnoActiveAtom + metadata XSLTno ActiveGood for prototyping Web Serviceno ActiveBrings in remote data

Linked Data Recommended literature: [3], [21-25]

Linked Data vs. Semantic Web “In contrast to the full-fledged Semantic Web vision, linked data is mainly about publishing structured data in RDF using URIs rather than focusing on the ontological level or inference. This simplification - just as the Web simplified the established academic approaches of Hypertext systems - lowers the entry barrier for data providers, hence fosters a widespread adoption.” [23] vs.

Linked Data Principles 1.Use URIs as names for things. 2.Use HTTP URIs so that people can look up those names. 3.When someone looks up a URI, provide useful RDF information. 4.Include RDF statements that link to other URIs so that they can discover related things. Linked Data is about the use of Semantic Web technologies to publish structured data on the Web and set links between data sources. Figure from C. Bizer

Linking Open Data Project What? Community project with W3C support “The goal of the W3C SWEO Linking Open Data community project is to extend the Web with a data commons by publishing various open data sets as RDF on the Web and by setting RDF links between data items from different data sources. “ [24] Follows the Linked Data principles. Basic idea: take existing (open) data sets and make them available on the Web in RDF. Once published in RDF, interlink them with other data sets. Example RDF links: owl:sameAs.

LOD Cloud May 2007 Figure from [2]

LOD Cloud September 2008 Figure from [2]

LOD Cloud March 2009 Figure from [2]

Linked Data Publishing in 7 Steps 1.Select vocabularies. 2.Partition the RDF graph into “data pages”. 3.Assign a URI to each data page. 4.Create HTML variants of each data page. 5.Assign a URI to each entity. 6.Add page metadata and link sugar. 7.Add a Semantic Sitemap. Slide by R. Cyganiak (see also [22])

Linking Popular predicates for linking: e.g., owl:sameAs, foaf:homepage, foaf:topic, foaf:based_near, foaf:maker/foaf:made, foaf:depiction, foaf:page, foaf:primaryTopic, rdfs:seeAlso Possible linking for Wiskii.com (example distillery) Content on this slide by T. Heath, M. Hausenblas, C. Bizer, R. Cyganiak, O. Hartig

Tools and Applications Linked Data browsers: Tabulator Browser (MIT, USA), Marbles (FU Berlin, DE), OpenLink RDF Browser (OpenLink, UK), Zitgist RDF Browser (Zitgist, USA), Disco Hyperdata Browser (FU Berlin, DE), Fenfire (DERI, Ireland) Linked Data mashups: Revyu.com (KMI, UK), DBtune Slashfacet (Queen Mary, UK), DBPedia Mobile (FU Berlin, DE), Semantic Web Pipes (DERI, Ireland) Search engines: Falcons (IWS, China), Sindice (DERI, Ireland), MicroSearch (Yahoo, Spain), Watson (Open University, UK), SWSE (DERI, Ireland), Swoogle (UMBC, USA) Listing on this slide by T. Heath, M. Hausenblas, C. Bizer, R. Cyganiak, O. Hartig

Adding Legacy Data to the Web of Data Bringing relational data to the Web of Data –Pubby –Triplify –D2RQ –Virtuoso RDF Views Extracting data from the Web (e.g., DBPedia: data extraction from Wikipedia) RDFizers: from JPEG, , BibTex, Java bytecode, Javadoc, weatherreport, Excel,... to RDF

DBPedia Figure from [26]

Example Mashup: Revyu.com Try yourself: Picture from revyu.com

Example Mashup: DBPedia Mobile Try yourself: Pictures from DBPedia Mobile

Current Developments: Interlinking Multimedia Recommended literature: [25], [27]

Interlinking Multimedia – The Vision 1.Show me photos of presidents of the European Commission visiting a country in Asia: –DBpedia: list EC presidents -: [L-EP] –Geonames: list Asian countries -: [L-AC] –Google: list photos taken in a country of [L-AC] -: [L-ACP] –Google: in [L-ACP] find regions that depict members of [L-EP] -: result 2.Give me a summary of all scenes from videos where EC presidents talk with an Asian monarch. The solution? MM Interlinking as a lightweight bottom up approach to interlink multimedia.

Interlinking Multimedia – Principles and Requirements 1.To become part of the LOD cloud, the Linked Data principles should be followed. 2.Consider the characteristics of multimedia (e.g. highly subjective semantics) and thus consider provenance (who said what and when?). 3.Metadata descriptions have to be interoperable in order to reference and integrate parts of the described resources. 4.Localizing and identifying fragments is essential in order to link parts of resources with each other. 5.Interlinking methods need to be available, which are essential in order to manually or (semi-) automatically interlink multimedia resources (e.g., automatic interlinking, user contributed interlinking, emergent interlinking, and game-based interlinking c.f. [27]).

Wrap Up and Take Home Vision of the “Web of Data” How-to build the “Web of Data” –Embedding Structured Information via Microformats and RDFa –Extracting and generating structured information via GRDDL –Publishing Linked Data Outlook: Multimedia in the “Web of Data”

Questions?

Bibliography [1] V. Bush "As We May Think" The Atlantic Monthly, July, Re-print available online: (last accessed on ) [2] Hypertext, Wikipedia Entry, (last accessed on ) [3] Linked Data, (last accessed on ) [4] The Programmable Web – Web 2.0 APIs, (last accessed on ) [5] Microformats, (last accessed on ) [6] RDFa Primer, (last accessed on ) [7] Gleaning Resource Descriptions from Dialects of Languages (GRDDL), W3C Recommendation, (last accessed on ) [8] J. Allsop "Microformats: “Empowering Your Markup for Web 2.0", Friends of ed, [9] T. Celik and K. Marcs: “Real World Semantics” (last accessed on ) [10] RDFa in XHTML: Syntax and Processing, W3C Recommendation, (last accessed on )

Bibliography /2 [11] J. Helfin, J. Hendler, and S. Luke "Reading between the lines: Using shoe to discover implicit knowledge from the web" In: Proceedings of the AAAI-98 Workshop on AI and Information Integration, Madison, WI, July [12] D. Fensel, S. Decker, M. Erdmann, and R. Studer "Ontobroker: Or how to enable intelligent access to the www" In: Proceedings of the 11th Workshop on Knowledge Acquisition Modeling and Management, Banff, Canada, April , [13] Embeddable RDF (eRDF), (last accessed on ) [14] Tools. RDFa Wiki, (last accessed on ) [15] GRDDL Primer, (last accessed on ) [16] Gleaning Resource Descriptions from Dialects of Languages (GRDDL), W3C Recommendation 11 September 2007, (last accessed on ) [17] GRDDL Use Cases, (last accessed on ) [18] Yahoo SearchMonkey,

Bibliography /3 [19] SearchMonkey Guide, (last accessed on ) [20] P. Mika “The Anatomy of a SearchMonkey”, Nodalities Magazine Sep/Oct Available online: (last accessed on ) [21] T. Berners-Lee “Linked Data Principles”, (last accessed on ) [22] C. Bizer, R. Cyganiak, and T. Heath “How to Publish Linked Data on the Web”, (last accessed on ) [23] M. Hausenblas "Exploiting Linked Data For Building Web Applications" IEEE Internet Computing, 2009 (to appear) [24] Linking Open Data Community Project, (last accessed on ) [25] M. Hausenblas, R. Troncy, T. Bürger, and Yves Raimond "Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia Fragments." In: Proceedings of Linked Data on the Web 2009 (LDOW2009) (forthcoming)

Bibliography /4 [26] S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, and Z. Ives "DBpedia: A Nucleus for a Web of Open Data" In: Proc. of the 6th International Semantic Web Conference (ISCW) [27] T. Bürger and M. Hausenblas "Interlinking Multimedia - Principles and Requirements" In: Proceedings of the First International Workshop on Interacting with Multimedia Content on the Social Semantic Web, co-located with SAMT 2008, Dec, 3.-5., 2008