Microdata and schema.org

Slides:



Advertisements
Similar presentations
A guide to HTML. Slide 1 HTML: Hypertext Markup Language Pull down View, then Source, to see the HTML code. Slide 1.
Advertisements

Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.
CS570 Artificial Intelligence Semantic Web & Ontology 2
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
MICRODATA IN HTML 5.0 Technologies for Web Application Development Martin Nečaský Department of Software Engineering, Faculty of Mathematics and Physics,
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
RDFa Lite. RDFa 1.1 Lite is a subset of RDFa 1.1 Five simple attributes: vocab, typeof, property, resource, and prefix Completely upwards compatible RDFa.
Sematic Web Microdata, Microformat and RDF Advanced Web-based Systems | Misbhauddin.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Topics in this presentation: The Web and how it works Difference between Web pages and web sites Web browsers and Web servers HTML purpose and structure.
CS 415 N-Tier Application Development By Umair Ashraf July 16,2013 National University of Computer and Emerging Sciences Lecture # 12 HTML/ XHTML/ HTML5.
Microdata for Dallas County Historical & Genealogical Cemetery Data Tony Hanson Webmaster 1.
HTML5 Rob Larsen htmlcssjavascript.com htmlcssjavascript.com /downloads/html5.ppt.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
HTML 5 New Standardization of HTML. I NTRODUCTION HTML5 is The New HTML Standard, New Elements New Attributes Full CSS3 Support Video and Audio 2D/3D.
Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.
PRO HTML5 PROGRAMMING POWERFUL APIS FOR RICHER INTERNET APPLICATION DEVELOPMENT.
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 2 nd REVIEW Brussels, 4-5 February 2014 FP7-ICT
OWL Capturing Semantic Information using a Standard Web Ontology Language Aditya Kalyanpur Jennifer Jay Banerjee James Hendler Presented By Rami Al-Ghanmi.
 “Micro Data (like RDFa and Microformats) is a form of semantic mark-up designed to describe elements on a web page e.g. review, person, event etc. This.
© Copyright 2008 STI INNSBRUCK HTML Data Guide W3C Interest Group Note 08 March 2012 OC Working Group –
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 12 This presentation © 2004, MacAvon Media Productions Hypertext and Hypermedia.
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
 Structured Data An Introduction to Semantic Web “It is very hard for search engines to understand the structure and semantics of data embedded in an.
PRACTICAL KNOWLEDGE REPRESENTATION FOR THE WEB Frank van Harmelen Dieter Fensel AIFB Kim Kangil Structural Complexity Laboratory.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C - The World Wide Web Consortium W3C - The World Wide Web Consortium.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Microdata in HTML 5.0 Web Applications Martin Nečaský Department of Software Engineering, Faculty of Mathematics and Physics, Charles University in Prague,
Let’s look what flash can do: OK, that wasn’t flash! That was HTML5.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
HTML5 Structure. Slide 2 Introduction Introduce the purpose of the new HTML5 semantic tags Introduce the HTML5 outlining mode Using semantic metadata.
Semantic Web in Depth Schema.org RDFa, JSON-LD, Microdata Professor Steffen Staab 2016, Many slides courtesy by Dr. Nick Gibbins.
Blended HTML and CSS Fundamentals 3 rd EDITION Tutorial 1 Using HTML to Create Web Pages.
Agenda * What is HTML5 -- Its history and motivation * HTML/XHTML as Human / Machine Readable Format * HTML and its related technologies * Brief summary.
Decentralized Extensibility & HTML 5 (An introduction to the debate) Noah Mendelsohn Distinguished Engineer – IBM Corp. Co-chair: W3C Technical Architecture.
4.01 How Web Pages Work.
4.01 How Web Pages Work.
HTML CS 4640 Programming Languages for Web Applications
Embedding Knowledge in HTML
4.01 How Web Pages Work.
HTML 5.
HTML5 Basics.
Objective % Select and utilize tools to design and develop websites.
Linked Data Web that can be processed by machines
RDFa How and Why Ralph R. Swick World Wide Web Consortium
Experiences and Status
JSON-LD.
Web Engineering.
Objective % Select and utilize tools to design and develop websites.
Embedding Knowledge in HTML
Code Expert-Web design & Development Product by: Codexoxo Source:
Making data discoverable through mainstream search engines
How to publish in a format that enhances literature-based discovery?
RDA cataloguing and linked data
Microdata and schema.org
RDA Community and linked data
Embedding Knowledge in HTML
RDFa: Embedding RDF Knowledge in HTML
JSON for Linked Data: a standard for serializing RDF using JSON
Resource Description Framework (RDF)
4.01 How Web Pages Work.
4.01 How Web Pages Work.
Meaning Matters Semantic Markup.
JSON-LD.
HTML CS 4640 Programming Languages for Web Applications
Presentation transcript:

Microdata and schema.org

Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Developed by WHATWG* and supported by major search companies (Google, Microsoft, Yahoo, Yandex) Like RDFa, it uses HTML tag attributes to host metadata Vocabularies are controlled and hosted at schema.org * Web Hypertext Application Technology Working Group

What is WHATWG? Web Hypertext Application Technology Working Group Community interested in evolving the Web with focus on HTML and Web API development Ian Hickson is a key person, now at Google Founded in 2004 by individuals from Apple, Mozilla and Opera after a W3C workshop Concern about W3C's embrace of XHTML Worked on HTML5, developed Microdata spec

HTML5 Started by WHATWG as an alternative to XHTML, joined by W3C HTML5 recommendation, October 2014 HTML5.1 recommendation, November 2016 WHATWG will evolve it as a “living standard” HTML5 ≈ HTML + CSS + js Native support for graphics, video, audio, speech, semantic markup, … Current support in major browsers

Microdata The microdata effort has two parts: A markup scheme A set of vocabularies/ontologies The markup is similar to RDFa in providing ways to identify subjects, types, properties & objects Also a standard way to encode Microdata as RDFa Sanctioned vocabularies at schema.org and include a small number of very useful ones: people, movies, events, recipes, etc.

An example <div> <h1>Avatar</h1> <span>Director: James Cameron (born 1954) </span> <span>Science fiction</span> <a href=”avatar-trailer.html">Trailer</a> </div>

An example: itemscope An itemscope attribute identifies a content subtree that is the subject about which we want to say something <div itemscope > <h1>Avatar</h1> <span>Director: James Cameron (born 1954) </span> <span>Science fiction</span> <a href=”avatar-trailer.html">Trailer</a> </div>

An example: itemtype An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type <div itemscope itemtype="http://schema.org/Movie"> <h1>Avatar</h1> <span>Director: James Cameron (born 1954) </span> <span>Science fiction</span> <a href=”avatar-trailer.html">Trailer</a> </div>

Microdata <-> RDF http://rdf-translator.appspot.com/

Microdata <-> RDF http://rdf-translator.appspot.com/

An example: itemtype An itemscope attribute identifies content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type <div itemscope itemtype="http://schema.org/Movie"> <h1>Avatar</h1> <span>Director: James Cameron (born 1954) </span> <span>Science fiction</span> <a href=”avatar-trailer.html">Trailer</a> </div> [ ] a schema:Movie .

An example: itemprop An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type An itemprop attribute gives a property of that type <div itemscope itemtype="http://schema.org/Movie"> <h1 itemprop="name">Avatar</h1> <span>Director: James Cameron (born 1954) </span> <span itemprop="genre">Science fiction</span> <a href=”avatar-trailer.html” itemprop="trailer">Trailer</a> </div>

An example: itemprop An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type An itemprop attribute gives a property of that type <div itemscope itemtype="http://schema.org/Movie"> <h1 itemprop="name">Avatar</h1> <span>Director: James Cameron (born 1954) </span> <span itemprop="genre">Science fiction</span> <a href=”avatar-trailer.html” itemprop="trailer">Trailer</a> </div> [ ] a schema:Movie ; schema:genre "Science fiction" ; schema:name "Avatar" ; schema:trailer <avatar-trailer.html> .

An example: embedded items An itemprop immediately followed by another itemcope makes the value an object <div itemscope itemtype="http://schema.org/Movie"> <h1 itemprop="name">Avatar</h1> <div itemprop="director" itemscope itemtype="http://schema.org/Person"> Director: <span itemprop="name">James Cameron</span> (born <span itemprop="birthDate">1954</span>) </div> <span itemprop="genre">Science fiction</span> <a href="avatar-trailer.html" itemprop="trailer">Trailer</a>

An example: embedded items [ ] a schema:Movie ; schema:director [ a schema:Person ; schema:birthDate "1954" ; schema:name "James Cameron" ] ; schema:genre "Science fiction" ; schema:name "Avatar" ; schema:trailer <avatar-trailer.html> . An itemprop immediately followed by another itemcope makes the value an object <div itemscope itemtype="http://schema.org/Movie"> <h1 itemprop="name">Avatar</h1> <div itemprop="director" itemscope itemtype="http://schema.org/Person"> Director: <span itemprop="name">James Cameron</span> (born <span itemprop="birthDate">1954</span>) </div> <span itemprop="genre">Science fiction</span> <a href="avatar-trailer.html" itemprop="trailer">Trailer</a>

schema.org vocabulary Full type hierarchy in one file 590 classes, >700 properties (Nov ‘16) Data types: Boolean, Date, DateTime, Number, Text, Time Objects: Rooted at Thing with two ‘metaclasses’ (Class and Property) and eight subclasses See github repo for examples and code

http://www.schema.org/Recipe

Testing Structured Data in HTML

Testing Structured Data in HTML

Testing Structured Data in HTML

Microdata as a KR language More than RDF, less than RDFS Properties have an expected type (range) Can be a list of types, any of which are OK Might be a string for many properties (“some data better than none”) Properties attached ≥ 1 types (domain) Classes can have multiple parents and inherit (properties) from all of them No axioms (e.g., disjointness, cardinality, etc.) No subPropertyOf like relation

Mixing vocabularies Microdata is intended to work with just one vocabulary: the one at schema.org Advantages Simple, organized, well designed Controlled by the schema.org people Disadvantages: too simple, controlled Too simple, narrow, mono-lingual

Extending schema.org ontology Extensions: hosted vs. external Hosted: managed & published by schema.org project You can subclass existing classes Person/Engineer Person/Engineer/ElectricalEngineer Subclass exisiting properties musicGroupMember/leadVocalist musicGroupMember/leadGuitar1 musicGroupMember/leadGuitar2

Extension Problems Hard to establish agreed upon meaning Through axioms supported by the language (e.g., equivalence, disjointness, etc.) No place for documentation (annotations, labels, comments) Without a namespace mechanism, your Person/Engineer and mine can be confused and might mean different things

Serialization Schema.org has a data model and serializations Microdata is the original, native sterilization RDFa is more expressive and works with the RDF stack Everyone agrees that RDFa Lite is a good encoding: as simple as Microdata but more expressive JSON-LD is an increasingly popular accepted encoding Search engines look for Microdata, RDFa and JSON-LD Schema.org considers RDFa to be the “canonical machine representation of schema.org”

Conclusions Microdata is an effort by a group of search companies to use a simple semantic language The semantics is pragmatic e.g., expected types: a string is accepted where a thing is expected – “some data is better than none” The real value is in the supported vocabularies and their use by Search companies => Immediate motivation for using semantic markup