Bringing NewsML2 into the Semantic Web Raphaël Troncy George Anadiotis Passepartout.

Slides:



Advertisements
Similar presentations
Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Mine Action Information Center
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
All Presentation Material Copyright Eurostep Group AB ® The Semantic Web Made Simple David Price December 2004
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
Information and Business Work
T Seminar on Multimedia Metadata Management Hannu Järvinen
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
The NSDL Registry Diane Hillmann  Jon Phipps. What We’re Doing Received an NSF grant in Oct. 2006, to: Register metadata schemas, vocabularies, application.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
1. 2 Module 7 Content and knowledge Management Objectives To provide basic concepts and knowledge of Content Management to CIOs and explore the applicability.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
2005 Adobe Systems Incorporated. All Rights Reserved. 1 Ontolog Forum Gunar Penikis Sr. Product Manager Adobe Systems.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
A Motivating Scenario for Designing an Extensible Audio- Visual Description Language Monday 25 th of October, 2004 Raphaël Troncy, Jean Carrive, Steffen.
Practical RDF Chapter 1. RDF: An Introduction
Deploying Trust Policies on the Semantic Web Brian Matthews and Theo Dimitrakos.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
Metadata Xiangming Mu. What is metadata? What is metadata? (cont’) Data about data –Any data aids in the identification, description and location of.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check This work by Oshani.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Semantic Clipboard User Interface is integrated in the Browser Architecture of the Semantic Clipboard Illustration of a license incompliant content reuse.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Working with Ontologies Introduction to DOGMA and related research.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
COMM: Designing a Well-Founded Multimedia Ontology for the Web Wednesday 14 th of November, 2007 Richard Arndt Steffen Staab Rapha.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
A Reduced Yet Extensible Audio- Visual Description Language: How to Escape From The MPEG-7 Bottleneck Thursday 28 th of October, 2004 Raphaël Troncy, Jean.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
WP5: Semantic Multimedia
Working meeting of WP4 Task WP4.1
Bringing The IPTC News Architecture into the Semantic Web
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
An Overview of MPEG-21 Cory McKay.
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
SDMX IT Tools SDMX Registry
Presentation transcript:

Bringing NewsML2 into the Semantic Web Raphaël Troncy George Anadiotis Passepartout

6/3/201541st IPTC Annual General Meeting2 Why Bother with Metadata? A News agency is a content provider Content (stories, photo, video, etc.) are assets Metadata add value to these assets as they provide human and machine readable information about them Metadata is much more than just a bunch of keywords added at the end of the chain so the customer can find your image Metadata covers all information about an asset, which enables machines to do smart things with your assets

6/3/201541st IPTC Annual General Meeting3 Why Bother with Semantics? High quality semantic multimedia metadata enables: Easy exchange of news items Semantic search of particular news items Delivery of personalized news content to customers  Interactive browsing in a news archive  Cross-modality: packaging the news stories, photos, graphics, audio, videos  For different end-user platforms (mobiles, PC, handhelds, etc.)

6/3/201541st IPTC Annual General Meeting4 IPTC Metadata Standards Metadata "fields" Informal definition and guidelines to use the field according to its semantics e.g. "Date Created": content creation date ≠ digital representation creation date

6/3/201541st IPTC Annual General Meeting5 IPTC Metadata Standards Metadata "values" Expressed as controlled vocabularies (standardization bodies) A vocabulary is composed of terms (flat list, taxonomy organization) IPTC has defined 28 sets of multilingual News Codes  NewsCodes use numeric strings = language agnostic  Ex: Subject ≈ 1300 terms, 3 levels hierarchy in 4 languages  NewsCodes Viewer application ViewView XML Wrapper Metadata embedded in a photo: XMP Metadata stored in a separate file: NewsML

6/3/201541st IPTC Annual General Meeting6 Problem: XML and Semantic *) references testimonial subject company presentation </Testimonial...           Need for formal semantics for the content *) adapted from Frank van Harmelen

6/3/201541st IPTC Annual General Meeting7 Problem: interoperability Different management applications may label the same field differently e.g. Creator / By-Line (Author) / Author / By-Line The informal semantics (guidelines) of the various metadata fields prevent an automatic validation of their use  Need for formal semantics for the structure

6/3/201541st IPTC Annual General Meeting8 Role of the Semantic Web "Oh no! Not yet another metadata standard!" Like we don't have enough of them already: EXIF, Dublin Core, VRA Core, IPTC Core, XMP, MPEG-7, Creative Commons,... ? But again: No single standard can cover all metadata needs SW is a framework that could make existing metadata standards and tools interoperable... and make them interoperable with the rest of the Web!

6/3/201541st IPTC Annual General Meeting9 NewsML2 and the SW Common basis Distributed resources (news item) globally and uniquely identified => URI Use of shared and controlled vocabularies Natural switch and numerous benefits Better control of NewsML2 descriptions (logical consistency check) Enhanced search of News topic (logical inferences) Intelligent presentation – Semantic interfaces Unified news management – Semantic CMS

6/3/201541st IPTC Annual General Meeting10 Use Case scenario Q: News about the leader of the Nepal country ? King Nepal's King Gyanendra attended a Hindu festival in Kathmandu, his first public appearance since being stripped of most of his powers by parliament last month....

6/3/201541st IPTC Annual General Meeting11 Use Case scenario Q: News about the leader of the Nepal country ? The King Gyanendra of Nepal The Prime Minister Girija Prasad Koirala

6/3/201541st IPTC Annual General Meeting12 What we have done? Creation of a News domain ontology in OWL Based on the UML model specifications of NewsML2 Online conversion service Mapping of the IPTC NewsCodes into various SKOS thesaurus Transforming dynamically the NewsML2 (XML) descriptions in its equivalent RDF counterpart  Using to the NewsML ontology  Linking to the SKOS IPTC NewsCodes

6/3/201541st IPTC Annual General Meeting13 What is the added value? Example: A "normal" day in AFP Dataset 200 NewsML2 stories, 35 photos (original size + thumbnails) + 35 NewsML2 descriptions Covering various subjects:  A military drill for dealing with contaminations (toxic, nuclear or biological) - Photomilitary drill for dealing with contaminationsPhoto  A regular meeting of the French cabinet - Photoregular meeting of the French cabinetPhoto  A strike in New Caledonia - Photostrike in New CaledoniaPhoto  A protest made on the Arch of Triumph in Paris, related to the Iran nuclear crisis - Photoprotest made on the Arch of Triumph in ParisPhoto  A wine makers protest - Photowine makers protestPhoto  A meeting between the French president and Israeli prime minister - Photomeeting between the French president and Israeli prime minister Photo  A senator's publicity pictures - Photosenator's publicity picturesPhoto

6/3/201541st IPTC Annual General Meeting14 Example 1: reasoning on the content Find all related news about "Nuclear" Nuclear NucléaireMilitary drill (NBC) Arc de Triomphe protest Iran nuclear crisis Chirac – Elmer summit

6/3/201541st IPTC Annual General Meeting15 Example 2: reasoning on the structure Find photos of Y for which the author is X ? What the NewsML ontology provide ? slugline and headline are metadata properties, whose values are Basic Components creator and contributor are authors history of the description (versioning) No need to know the NewsML structure to answer the query

6/3/201541st IPTC Annual General Meeting16 What to do with the RDF data? Various tools that are able to digest RDF data and provide a unified view of these data FOAF Viewer SIMILE project /facet: A Browser for Heterogeneous Semantic Web repositories Faceted browser paradigm (Flamenco) Provide a view on any RDF dataset

6/3/201541st IPTC Annual General Meeting17 W3C Multimedia Semantics Incubator Group

6/3/201541st IPTC Annual General Meeting18 W3C Multimedia Semantics Incubator Group Light-weight, one year (May 2007) group looking at image and other multimedia metadata on the Web Focus on interoperability with existing standards You can help and shape the future of multimedia metadata on the Web! We need your input: Provide use cases & examples Tell us which standards are most important to you (we'll have to prioritize) Review the notes we write

6/3/201541st IPTC Annual General Meeting19 How can you help? Keep an eye on our web page Become active on our mailing list (everything we do is public!) Send us your comments, use cases, examples, priorities, etc. Don't be shy! Join us and get the right to vote on all our decisions Free for all W3C member organizations On invitation for non-members

6/3/201541st IPTC Annual General Meeting20 Conclusion Methods and conversion tools for bringing NewsML in the SW (RDF - compliant) Added-value: Enhance search of news items (logical inferences on the structure and the content) Enhance presentation of news items  Semantic media interfaces  Discover relations between Items / Topics / Packages Semantic Content Management System  Keep track of provenance information

6/3/201541st IPTC Annual General Meeting21 Future Work Making the use case scenario REAL! Needs data: photos, videos, graphics, audio, textual stories ! (world cup news preferred :-) Implement interfaces for: Browsing a News archive Rendering the search results Establishing links between NewsML and other vocabularies IPTC News Codes versus domain ontologies NewsML versus DC, EXIF, MPEG-7, etc.

6/3/201541st IPTC Annual General Meeting22 Semantic Media Interfaces

6/3/201541st IPTC Annual General Meeting23 NewsCodeViewer back

6/3/201541st IPTC Annual General Meeting24 Myths about the Semantic Web *) 1. "SW people try to enforce meaning from the top" They only recommend languages that you can use to define your concepts according to your definitions 2. "SW people will require everybody to subscribe to a single predefined 'meaning' for the terms we use" You can use these languages to relate existing concepts (bridging communities) 3. The SW will require users to understand the complicated details of formalized knowledge representation All of this 'under the hood' 4. "SW people will require us to manually annotate all the existing web-pages" SW languages can be used to exchange manually and automatically produced metadata *) adapted from Frank van Harmelen, WWW2006 panel "Meaning on the Web: Evolution or Intelligent Design?"WWW2006 panel "Meaning on the Web: Evolution or Intelligent Design?"