Bringing The IPTC News Architecture into the Semantic Web

Slides:



Advertisements
Similar presentations
Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.
Advertisements

Improving Learning Object Description Mechanisms to Support an Integrated Framework for Ubiquitous Learning Scenarios María Felisa Verdejo Carlos Celorrio.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
KOM, SEKE, June 20, 2004 Representing Chains of Custody Along a Forensic Process: A Case Study on Kruse Model Tamer Fares Gayed, UQAM Hakim Lounis, UQAM.
MPEG-7 based Multimedia Ontologies: Interoperability Support or Interoperability Issue? Wednesday 5 th of December, 2007 Oscar CelmaRapha.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
Bringing NewsML2 into the Semantic Web Raphaël Troncy George Anadiotis Passepartout.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Flink: Lessons of interoperability Peter Mika Dept. of Business Informatics Free University Amsterdam 1 st Intl. Workshop on.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
Context and Prosopography: Putting the 'Archives' Into LOD-LAM Corey A Harper SAA MDOR
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Break Out Session on Infrastructure and Technology: A Report Vipul Kashyap AOS Workshop, Rome, 15 November 2001
The Earth System Curator Metadata Representations Prototype Portal in Collaboration with ESMF and ESG Rocky Dunlap Spencer Rugaber Georgia Tech.
Integrating Structure and Semantics into Audio-visual Documents Tuesday 21 st of October, 2003 Raphaël Troncy 2nd International Semantic Web Conference.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantic Web Exam 1 Review.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
COMM: Designing a Well-Founded Multimedia Ontology for the Web Wednesday 14 th of November, 2007 Richard Arndt Steffen Staab Rapha.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair Vienna,
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
INHA UNIVERSITY, KOREA Rainer Simon Austrian Institute of Technology.
Working meeting of WP4 Task WP4.1
Linked Data Web that can be processed by machines
Introduction to Persistent Identifiers
Resource Description Framework
Yaşar Tonta & Orçun Madran [yasartonta, Hacettepe University
Data.gov: Web, Data Web, Social Data Web 7/22/2010 #health2stat.
Integrating Data for Archaeology
Middleware independent Information Service
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Analyzing and Securing Social Networks
Applications of IFLA Namespaces
NKOS workshop Alicante, 2006
Vocabularies and Semantics
PREMIS Tools and Services
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Searching and browsing through fragments of TED Talks
RDF Standard Data Model Exchange
Semantic Annotation service
LOD reference architecture
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Visual recall of class information
Semantic Statistics DDI Lifecycle: Moving Forward Outcome of the Recent Workshops in Dagstuhl Joachim Wackerow.
Taxonomy of public services
Taxonomy of public services
Australian and New Zealand Metadata Working Group
Presentation transcript:

Bringing The IPTC News Architecture into the Semantic Web Raphaël Troncy, <Raphael.Troncy@cwi.nl> CWI, Semantic Media Interfaces ISWC 2008: Wednesday, 29 October 2008 1

videos cartoons ISWC 2008: Wednesday, 29 October 2008

animations blogs ISWC 2008: Wednesday, 29 October 2008

News Workflow Interoperability No integration of media (stories, photo, animation, video) Little (or no) context in the news presentation Lack of interoperability in the current workflow NAR Schema Broadcaster Schema User Vocabulary NewsCodes Controlled Vocabularies ISWC 2008: Wednesday, 29 October 2008 ISWC 2008: Wednesday, 29 October 2008 4 4

Metadata is Key (Ultimate) Goal: Required integration: Provide an environment for searching and browsing contextualized multimedia news information Required integration: Data: various media, different forms, various sources Metadata: schema integration, semantic models Influence and implications of UI: How to represent semantic multimedia metadata to facilitate presenting information? in other words ... What constraints do end-user interfaces put on the modeling of the metadata? ISWC 2008: Wednesday, 29 October 2008 ISWC 2008: Wednesday, 29 October 2008 5

News and Multimedia Formats News Architecture NewsML G2 EventsML SportsML (NAR) ISWC 2008: Wednesday, 29 October 2008

Porting Schemas and Thesauri to the Semantic Web Methodologies and tools for building ontologies: ... from scratch ʺSKOSificationʺ of thesauri in the CH domain: preparation, syntactic and semantic conversion, standardization  Lack of best practices for modeling ontologies from UML diagrams, integrating ontologies with various thesauri, while taking the end-user interface into account ISWC 2008: Wednesday, 29 October 2008 ISWC 2008: Wednesday, 29 October 2008 7

Building a Semantic Web Infrastructure for News 1 2 3 4 Modeling the NAR ontology Linking with media ontologies Building SKOS thesauri Enriching the metadata ISWC 2008: Wednesday, 29 October 2008 ISWC 2008: Wednesday, 29 October 2008 8

Step 1: Modeling the NAR Ontology  focus on reuse of XML types leading to multiple repetition resulting in overly complex nested XML structures ISWC 2008: Wednesday, 29 October 2008

Step 1: Modeling the NAR Ontology Flattening the XML structure NewsItem PhotoNewsItem ISWC 2008: Wednesday, 29 October 2008

Step 1: Modeling the NAR Ontology Modeling unique identifiers Use of dereferencable URIs for any resources (news items + vocabularies) Future: Use of URIs for resource fragments http://www.youtube.com/watch?v=1bibCui3lFM#t=1m45s Modeling the provenance of the information Reification Named (and Networked) Graphs {<> nar:subject cat:11002000} dc:creator team:md ; dc:modified ‘‘2005-11-11T08:00:00Z’’. ISWC 2008: Wednesday, 29 October 2008

Step 2: Linking with Media Ontologies dc:Subject ≈ nar:Subject foaf:Person ≈ nar:Person sioc:Item ≈ nar:Item + geo:lat geo:long ISWC 2008: Wednesday, 29 October 2008

Step 3: Getting SKOS Vocabularies ISWC 2008: Wednesday, 29 October 2008

Step 3: Getting SKOS Vocabularies ISWC 2008: Wednesday, 29 October 2008

Step 4: Enriching the News Metadata Concepts/Entities that are subject of news Thematic categories People Organizations Geopolitical Areas Points of Interest Events Products or artefacts © IPTC – www.iptc.org

Step 4: Enriching the News Metadata Named Entity Recognition Domain Ontologies NAR Ontology NewsCodes Thesaurus ISWC 2008: Wednesday, 29 October 2008

Step 4: Enriching the News Metadata Concept Detectors Domain Ontologies NAR Ontology NewsCodes Thesaurus ISWC 2008: Wednesday, 29 October 2008

Web of Data and Linked Data wp:2006_FIFA_Wolrd_Cup#Final nc:15054000 nar:subject events:id nar:location foaf:depicts geonames:2950159 dbpedia:Zidane ISWC 2008: Wednesday, 29 October 2008

Presenting News Information Dimensions used for searching news items When time 10/07/2006 Where location Paris What is depicted J. Chirac, Z. Zidane Why event WC 2006 Who photographer Bertrand Guay, AFP Metadata ISWC 2008: Wednesday, 29 October 2008

Semantic Search of Multimedia News Description Number of RDF Triples General Ontologies: NAR, DC, FOAF 7,336 Domain Specific Ontologies: football 104,358 Thesauri: newscodes 34,903 DBpedia, Geonames 53,468 AFP News Feed (June/July 2006) 804,446 AFP Photos (June/July 2006) 61,311 INA Broadcast Video (June/July 2006) 1,932 Total 1,067,754 Powered by ClioPatria 1.0 alpha 3 ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008

Conclusion 4-Steps methodology for building an ontology-based news infrastructure UML-2-OWL: Flatten XML structure, Identify all resources SKOS-ify existing thesauri and use the Web of Data Reuse what is there ... Expose what you make Enrich metadata with text and visual analysis Provide new dimensions (facets) for browsing the data Ex: distinguish field images vs stadium and street images with a grass detector for the World Cup dataset ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008

Future Work Data Modeling End-user Interfaces Data Quality Events Model End-user Interfaces Yahoo! Search BOSS Data Quality Named Entity Recognition (Calais), Disambiguation algorithms (SemanticProxy), Visual clustering, Video segmentation ISWC 2008: Wednesday, 29 October 2008

Credits Datasets: People: More info: http://newsml.cwi.nl ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008

ISWC 2008: Wednesday, 29 October 2008