Formalization of documentary knowledge and conceptual knowledge with ontologies : applying to the description of audio-visual documents Friday 23 rd of.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

AVATAR: Advanced Telematic Search of Audivisual Contents by Semantic Reasoning Yolanda Blanco Fernández Department of Telematic Engineering University.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Using XSLT for Interoperability: DOE and The Traveling Domain Experiment Monday 20 th of October, 2003 Antoine Isaac, Raphaël Troncy and Véronique Malaisé.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Logics for Data and Knowledge Representation Projects and thesis introduction.
MPEG-7 based Multimedia Ontologies: Interoperability Support or Interoperability Issue? Wednesday 5 th of December, 2007 Oscar CelmaRapha.
Multimedia Semantic Web and MPEG-7 Ana B. Benitez ee.columbia.edu Image and Advanced Television Lab (ADVENT) Department of Electrical Engineering.
DL:Lesson 11 Multimedia Search Luca Dini
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Ontology Notes are from:
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
3. Technical and administrative metadata standards Metadata Standards and Applications.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Philips Research France Delivery Context in MPEG-21 Sylvain Devillers Philips Research France Anthony Vetro Mitsubishi Electric Research Laboratories.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
MPEG-7 Multimedia Content Description Standard January 8, 2003 John R. Smith Pervasive Media Management Group IBM T. J. Watson Research Center 19 Skyline.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Structured Media for Media Integration & Document Authoring Tien TRAN_THUONG and Cécile ROISIN Project OPERA - INRIA Grenoble - France.
New trends in Semantic Web Cagliari, December, 2nd, 2004 Using Standards in e-Learning Claude Moulin UMR CNRS 6599 Heudiasyc University of Compiègne (France)
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
A Motivating Scenario for Designing an Extensible Audio- Visual Description Language Monday 25 th of October, 2004 Raphaël Troncy, Jean Carrive, Steffen.
Integrating Structure and Semantics into Audio-visual Documents Tuesday 21 st of October, 2003 Raphaël Troncy 2nd International Semantic Web Conference.
The MPEG-7 Standard - A Brief Tutorial - Ali Tabatabai Sony US Research Laboratories February 27, 2001.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Logics for Data and Knowledge Representation
A Proposal for a Video Modeling for Composing Multimedia Document Cécile ROISIN - Tien TRAN_THUONG - Lionel VILLARD Presented by: Tien TRAN THUONG Project.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
WebODE and its Ontology Management APIs. April 8th © Ontology Engineering Group WebODE and its Ontology Management APIs Ontology Engineering Group.
MPEG-7 Interoperability Use Case. Motivation MPEG-7: set of standardized tools for describing multimedia content at different abstraction levels Implemented.
Semantic Commitment for Designing Ontologies: a Proposal Bruno Bachimont Raphaël Troncy Antoine Isaac Institut National de l’Audiovisuel France.
CORPORUM-OntoExtract Ontology Extraction Tool Author: Robert Engels Company: CognIT a.s.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Description of some multimedia ontologies Rapha ë l Troncy Thursday 1 st of December, 2005.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Metadata Schema for CERIF Andrei Lopatenko Vienna University of Technology
Integration of Domain & Application Knowledge in MPEG-7/21 in the DS-MIRF Framework Laboratory of Distributed Multimedia Information Systems & Applications.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
The Semantic Web and Ontology. The Semantic Web WWW: –syntactic transmission of information –only processible by human – no semantic conservation of the.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
COMM: Designing a Well-Founded Multimedia Ontology for the Web Wednesday 14 th of November, 2007 Richard Arndt Steffen Staab Rapha.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
A Reduced Yet Extensible Audio- Visual Description Language: How to Escape From The MPEG-7 Bottleneck Thursday 28 th of October, 2004 Raphaël Troncy, Jean.
Designing and Using an Audio-Visual Description Core Ontology Friday 8 th of October, 2004 Antoine Isaac & Raphaël Troncy.
Ontology Technology applied to Catalogues Paul Kopp.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Mechanisms for Requirements Driven Component Selection and Design Automation 최경석.
Working meeting of WP4 Task WP4.1
DOMAIN ONTOLOGY DESIGN
ece 627 intelligent web: ontology and beyond
Multimedia Content Description Interface
Ontology-Based Approaches to Data Integration
Presentation transcript:

Formalization of documentary knowledge and conceptual knowledge with ontologies : applying to the description of audio-visual documents Friday 23 rd of April, 2004 Raphaël Troncy

23/04/2004 CWI Talk - Raphaël Troncy1 Background The audio-visual document : some peculiarities –structured –spatio-temporal –composed of images The digital audio-visual document : –allow new possibilities : « intelligent » search AV library structuration publication and broadcasting –need for an hyper-linked description: the content has to be linked with the description use of a textual description

23/04/2004 CWI Talk - Raphaël Troncy2 Plan of this talk 1.Problems 2.Document engineering vs. knowledge representation 3.Our proposal: an architecture for reasoning on descriptions of video documents 4.Experimentations 5.Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy3 Description of the AV content A three step process : –identification of the content creator and the content provider : Dublin Core metadata, VRA core categories … –structural decomposition in video segments corresponding to the logical structure of the program : time-code, spatial coordinates –semantic description of these segments : controlled vocabulary, thesaurus, free text annotation 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy4 Description of the AV content Segmentation –locate and date some events Description –characterize each segment with an AV genre –characterize each segment with a general thematic –describe the scene (who, when, where, what, …) describe the logical structure describe the semantics of the content 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy5 Example Q : Find all AV sequences of type interview with Sandy Casar and concerning the Paris-Nice cycling race –noise answer: there are other sports news in the sequence –incomplete answer: the interview was broadcasted in two parts and began in a previous sequence –the query cannot be extended ! 13 [Indoor Set: 6 th part] at 18:43:56: :09:06:00. – Eurosport In studio, the second part of the interview, from Nice, of Sandy CASAR by Jean René GODART about the Paris-Nice cycling race and a few sports news with pictures commented by Alexandre BOYON and Laurent PUYAT. Q : Find all AV sequences of type dialog sequence with a rider and concerning any cycling race with several stages 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy6 Requirements : –express models that constrain the logical structure identify an interview inside a report of a sports magazine –represent the meaning contained in this structure a cartoon is a fiction with no real characters –describe semantically the content of each sequence the Prologue is always an individual time trial numbered stage 0  Which languages are the most suitable to perform all these tasks ?  What kind of knowledge do we need ? Problems Weak use of the logical structures Descriptions are not made for reasoning  make the AV descriptions accessible to automated processes 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy7 Document engineering Provide models, languages and tools for managing document libraries Encode both structured documents and structured data: XML [W3C, 1998] & XML Schema [W3C, 2001] Distinguish the content from its presentation –Languages for presenting multimedia documents : SMIL –Models for describing multimedia documents from HyTime [ISO, 1997] to MPEG-7 [ISO, 2001] 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy8 MPEG-7, the new multimedia description language? ISO standard since December of 2001 Main components: –Descriptors (Ds) and Description Schemes (DSs) –DDL (XML Schema + extensions) Concern all types of media Part 5 - MDS 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy9 Structure and semantics Structure Base unit: segment - temporal bounds or mask Possible decomposition 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy10 Semantics –entity –attribute –relation Classification Schemes (CS) –thesauric relationships Structure and semantics 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy11 MPEG-7 = a rich set of descriptors, but too restrictive to cover all the possible descriptions MPEG-7 extension with XML Schema: –Example: TV Anytime, Mdéfi [Tran Thuong, 2003] –Problem: add structure without semantics MPEG-7 extension with CS : –Example: the COALA system [Fatemi, 2003] –Problem: very poor expressivity Free annotation, knowledge-oriented –Strates-IA [Prié, 1999]: no control of the structure –E-SIA [Egyed-Zs, 2003]: knowledge base lost  MPEG-7+XML Schema are not enough! … but KR brings new solutions Other models 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy12 The formal specification of a conceptual model for a given domain –A set of concepts, of relations and axioms –Knowledge representation languages Methodologies of construction: –Adaptation of well-known software engineering guidelines: Methontology [Gomez-Perez] –Terminological acquisition: [Bachimont], [Aussenac Gilles] –Ontology cleaning with formal properties: [Guarino] Tools : –Protégé, WebODE, OilEd, OntoEdit, Terminae, DOE Ontologies in KR 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy13 RDF : [W3C, 1999 & W3C, 2004] –a data model for annotating Web resources –triples: resource → property → value RDFS : [W3C, 2004] –definition of the vocabulary OWL : [W3C, 2004] –hierarchy of classes and relations –axioms: algebraic properties, concept definitions, set operators, cardinalities KR languages for the Web (:"Stade 2" rdf:type ina:SportsNews) (:"Stade 2" ina:broadChannel "France2") (:"Stade 2" ina:broadDate ) 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy14 Definition of concepts and relations StudioProgram  and ( HomogeneousProgram (all hasPart StudioSequence) ) Definition of axioms HomogeneousProgram  HeterogeneousProgram =  Inferences if ONPP isA StudioProg then  seq  ONPP, seq isA StudioSeq Use of OWL+RDF for describing AV documents  Problem: how to control the structure of the descriptions ? 2. Document engineering vs. KR 2.1. Document engineering 2.2. Knowledge representation

23/04/2004 CWI Talk - Raphaël Troncy15 Our proposition Use jointly both approaches for representing the descriptions –the markup languages for describing and controlling the structure of each program –the ontology and the KR languages for describing formally the semantics of this structure and the content Automatize as much as possible the translation between these two representations Develop an architecture for reasoning on descriptions of video documents 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy16 General architecture 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy17 The Audio-visual Ontology Methodology of construction: ARCHONTE [Bachimont] –Conceptualization : differential principles –Formalization : formal definitions, axioms –Operationalization : export into a KR language AV domain: –Production objects (program, sequence, AV genre), Properties (theme), Persons, Technical Process (shooting, recording, post- production), Signal descriptors (audio, video), etc. Tools: –Conceptualization : DOE [Troncy & Isaac, IC’02] –Formalization : OilEd [Bechhofer, KI’01] –Languages : OWL Ontologies available on the Web: 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy18 The DOE ontology editor 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy19 Based on well-established professional practices Ontology export into the OWL language Results: –Construction time: 4 weeks –Ontology size quite important: 400 concepts OWL Formalization 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy20 General architecture 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy21 Generate XML Schema types OWL Class Sub-class Restriction on properties Union of classes XML Schema Complex type Extension Element of the content model Choice in the content model transformation Some concepts (program, sequence) refer to categories of audio-visual segments 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy22 Generic MPEG-7 extension Link these types to the existing MPEG-7 types 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy23 Build description schemes Let us watch some sports magazines –construction of a simple schema based on StudioSequence, Report and Interview –a Report contains some Excerpts of Broadcast Live Sports The schema provides the description skeleton for several sports magazine: –Téléfoot (soccer) –VéloClub (cycling) –3 Partout (multisports) 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy24 General architecture 3. Architecture proposal 3.1. AV Ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy25 SegmenTool [French projet CHAPERON] 3. Architecture proposal 3.1. AV Ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy T00:24:19 PT00H00M07S... Instantiate a document content model KB RDF triples 3. Architecture proposal 3.1. AV Ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy27 General architecture 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy28 Methodology of construction: –Terminological acquisition Textual corpus of words [LeRoux, 2003] Tool for candidate term extraction: Lexter –Conceptualization and formalization DOE + OilEd Results: –Construction time: 3 weeks conceptualization, upper level, formalization –Ontology size: average 97 concepts, 61 relations The Cycling Ontology 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy29 The Cycling Ontology 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population

23/04/2004 CWI Talk - Raphaël Troncy30 S EIGO [Le Roux, 2003] Knowledge Base population Cycling domain + Base of facts 3. Architecture proposal 3.1. AV ontology 3.2. Description schemes 3.3. Valid description 3.4. KB population text

23/04/2004 CWI Talk - Raphaël Troncy31 General architecture 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy32 1.First experimentation –Sesame : architecture for the storage of RDF triples [Broekstra, 2002] Supports different query languages: RQL, RDQL and SeRQL Implements the RDF Schema semantics (RDF-MT engine) –BOR : reasoner for the DAML+OIL language [Simov & Jordanov, 2002] –SeBOR : integration of the two systems, done in the On-To-Knowledge EU-IST Project 2.Second experimentation –Racer : OWL DL reasoner [Haarslev & Möller, 2001] –Rice : visualization interface [Möller et al., 2003] Experimentations 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy33 Conclusion General architecture for reasoning on descriptions of video documents: –Control of the structure: creation of document schemes –Formal representation of the semantics: AV ontology and domain-specific ontology –Based on standards languages (MPEG-7, OWL, RDF) and the use of transformations Implementation and experimentations –Generic extension of MPEG-7 –Modeling of 2 ontologies with DOE –Creation of a Knowledge Base of events related to cycling race and use of an adapted reasoner 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy34 Future work Development integration –Better integration of the tools used Planned experimentations –Populate a database with annotated video documents and test the system with a real panel of users –Apply this architecture to another domain than the cycling one –Benchmark the contribution of the AV ontology in a huge AV library without modifying the descriptions Long-term objectives –The ideal AV description language is still a research program –The description could be linked with: a rhetorical analysis of the documents a semiotic analysis of the documents 1. Problems 2. Document engineering vs. KR 3. Architecture proposal 4. Experimentations 5. Conclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy35 Questions? 1.ProblemsProblems 2.Document engineering vs. knowledge representationDocument engineering vs. knowledge representation 3.Our proposal: an architecture for reasoning on descriptions of video documentsOur proposal: an architecture for reasoning on descriptions of video documents 4.ExperimentationsExperimentations 5.Conclusion and future workConclusion and future work

23/04/2004 CWI Talk - Raphaël Troncy36 Advertising June 21-25: The Week of Digital Document La Rochelle - France Workshop on: (unfortunately in French) "Documentary Model for Audio-visual" Web Site: Deadline approaching … April 30

23/04/2004 CWI Talk - Raphaël Troncy37

23/04/2004 CWI Talk - Raphaël Troncy38

23/04/2004 CWI Talk - Raphaël Troncy39

23/04/2004 CWI Talk - Raphaël Troncy40

23/04/2004 CWI Talk - Raphaël Troncy41

23/04/2004 CWI Talk - Raphaël Troncy42