Institute of Informatics and Telecommunications – NCSR “Demokritos” Bootstrapping ontology evolution with multimedia information extraction C.D. Spyropoulos,

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
National Institute of Statistics, Geography and Informatics (INEGI) Implementation of SDMX in Mexico.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
© NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks.
GLOCAL Event-based Retrieval of Networked Media NEM Concertation Meeting Brussels, Feb
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Smart Learning Services Based on Smart Cloud Computing
Cluj Napoca, 28 August IEEE International Conference on Intelligent Computer Communication and Processing Digital Libraries Workshop Towards.
1 Image Video & Multimedia Systems Laboratory Multimedia Knowledge Laboratory Informatics and Telematics Institute Exploitation of knowledge in video recordings.
On the Need to Bootstrap Ontology Learning with Extraction Grammar Learning Kassel, 22 July 2005 Georgios Paliouras Software & Knowledge Engineering Lab.
Carlos Lamsfus. ISWDS 2005 Galway, November 7th 2005 CENTRO DE TECNOLOGÍAS DE INTERACCIÓN VISUAL Y COMUNICACIONES VISUAL INTERACTION AND COMMUNICATIONS.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
GEM/IRDR Social Vulnerability and Resilience Information System and Metadata Portal IRDR Scientific Board Meeting Chengdu 03/11/2012.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
NATIONAL TECHNICAL UNIVERSITY OF ATHENS Image, Video And Multimedia Systems Laboratory
MMSEM background Dr Ioannis Pratikakis Institute of Informatics & Telecommunications NCSR “Demokritos”, Athens, Greece MMSEM – F2F meeting Amsterdam, 10.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
Visualization, analysis and mining of geo- spatial information in educational data sets using web-based tools Aniruddha Desai |Winter 2013 Presentation.
The Yellow Group Design Informatics (Regli, Stone, Kusiak, Leifer, Gupta, Chung, Fenves, Law, Kopena)
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
EStream – Best Practice in the Use of Streaming Media © A. Knierzinger, C. Weigner Increasing the use of Streaming technology in school education in Europe.
Comp 20 - Training & Instructional Design Unit 6 - Assessment This material was developed by Columbia University, funded by the Department of Health and.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web services Interoperability for Geospatial decision.
FI-CORE Data Context Media Management Chapter Release 4.1 & Sprint Review.
IST DIVAS Presentation 1 Advanced search technologies for digital audio-visual content.
Informatics and Telematics Institute - CERTH 1 BOEMIE: Bootstrapping Ontology Evolution with Multimedia Information Extraction Vasileios Papastathis Centre.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP2 – Media Semantics and Ontologies.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
EASAIER Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Ying Ding.
Informatics and Telematics Institute Centre for Research and Technology Hellas ITI-CERTH Amsterdam, Multimedia Semantics XG, July 2006 Vasileios.
Institute of Informatics and Telecommunications – NCSR “Demokritos” 1 NCSR at INDIGO Vangelis Karkaletsis Kick-off Project Meeting Athens, 15 February.
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
NCSR “Demokritos” Institute of Informatics & Telecommunications CROSSMARC CROSS-lingual Multi Agent Retail Comparison Costas Spyropoulos & Vangelis Karkaletsis.
Application Ontology Manager for Hydra IST Ján Hreňo Martin Sarnovský Peter Kostelník TU Košice.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
And the Watson Plugin for the NeOn Toolkit. IST NeOn-project.org The Semantic Web is growing… #SW Pages.
Virtual Information and Knowledge Environments Workshop on Knowledge Technologies within the 6th Framework Programme -- Luxembourg, May 2002 Dr.-Ing.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004
K-WfGrid: Grid Workflows with Knowledge Ladislav Hluchy II SAS, Slovakia.
Constructing A Yami Language Lexicon Database from Yami Archiving Projects Meng-Chien Yang(Providence University, Taiwan) D. Victoria Rau(National Chung.
Institute of Informatics & Telecommunications NCSR “Demokritos” Spidering Tool, Corpus collection Vangelis Karkaletsis, Kostas Stamatakis, Dimitra Farmakiotou.
WP1: Plan for the remainder (1) Ontology –Finalise ontology and lexicons for the 2 nd domain (RTV) Changes agreed in Heraklion –Improvement to existing.
NCSR “Demokritos” Institute of Informatics & Telecommunications CROSSMARC CROSS-lingual Multi Agent Retail Comparison WP3 Multilingual and Multimedia Fact.
Digital Image Annotation Tool. INTRODUCTION Incorporation of digital media types Unstructured digital data Portal for managing annotations and tracking.
WP5: Semantic Multimedia
Representation and Analysis of Multimedia Content: The BOEMIE Proposal
 Corpus Formation [CFT]  Web Pages Annotation [Web Annotator]  Web sites detection [NEACrawler]  Web pages collection [NEAC]  IE Remote.
Institute of Informatics & Telecommunications NCSR “Demokritos”
Semantic Visualization
YourDataStories: Transparency and Corruption Fighting through Data Interlinking and Visual Exploration Georgios Petasis1, Anna Triantafillou2, Eric Karstens3.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
An ecosystem of contributions
Knowledge-based event recognition from salient regions of activity
Presentation transcript:

Institute of Informatics and Telecommunications – NCSR “Demokritos” Bootstrapping ontology evolution with multimedia information extraction C.D. Spyropoulos, G. Paliouras, V. Karkaletsis, D. Kosmopoulos, I. Pratikakis, S. Perantonis, B. Gatos

The facts  STRP, IST “Semantic-based Knowledge and Content Systems”  Start: March , End: February 28, 2009  Budget: Euro, Funding: Euro  Consortium –Inst. of Informatics & Telecommunications, NCSR “Demokritos” (SKEL & CIL), Greece (Coordinator) –Fraunhofer Institute for Media Communication (NetMedia), Germany –Dip. di Informatica e Comunicazione, University of Milano (ISLab), Italy –Inst. of Telematics and Informatics CERTH (IPL), Greece –Hamburg University of Technology (STS), Germany –Tele Atlas, The Netherlands  More than 30 people already active in the project  Project portal:

Objectives  Providing technology to represent and evolve domain-specific multimedia ontologies.  Moving from low-level, general-purpose, single- modality feature extraction towards semantic, multimedia analysis.  Robust and scalable ontology-driven multimedia content extraction through ontology evolution.

Approach  Driven by domain-specific multimedia ontologies, BOEMIE information extraction systems will be able to identify high-level semantic features in image, video, audio and text and fuse these features for optimal extraction.  The ontologies will be continuously populated and enriched using the extracted semantic content.  This is a bootstrapping process, since the enriched ontologies will in turn be used to drive the multimedia information extraction system.

The end user’s view  The user wants to see the marathon of the 2006 athletics world championship in Athens. She wants to retrieve images and video of participating athletes in previous marathons. –The system has extracted the participating athletes’ names from official Web sites. –It has also populated the marathon ontology with images and video of past events, relating them to the athletes through fusion with audio and text.

The end user’s view  The user also wants to select a good view of the event, by retrieving images and video, associated with landmarks of the city. –The system has identified landmarks in visual information about past marathons in Athens and has thus georeferenced the content. –Reasoning can associate the city landmarks with the event and the related content.

The service provider’s view EVOLVED ONTOLOGY INITIAL ONTOLOGY POPULATION & ENRICHMENT COORDINATION INTERMEDIATE ONTOLOGY ONTOLOGY EVOLUTION TOOLKIT LEARNING TOLS REASONING ENGINE MATCHING TOOLS ONTOLOGY MANAGEMENT TOOL ONTOLOGY INITIALIZATION AND CONTENT MANAGEMENT TOOL ONTOLOGY EVOLUTION EVENTS DATABASE MAPS DATABASE MAP ANNOTATION INTERFACE SEMANTICS EXTRACTION RESULTS OTHER ONTOLOGIES SEMANTICS EXTRACTION MULTIMEDIA CONTENT SEMANTICS EXTRACTION TOOLKIT TEXT EXTRACTION TOOLS AUDIO EXTRACTION TOOLS INFORMATION FUSION TOOLS VISUAL EXTRACTION TOOLS FROM VISUAL CONTENT FROM NON-VISUAL CONTENT FROM FUSED CONTENT Content Collection (crawlers, spiders, etc.)

The service provider’s view Customize and use the system: –Intialization: collecting, extending and merging ontologies for domains –Training: collecting a training data set, using it for the training of the semantics extraction and ontology evolution tools –Information gathering: continuous collection of content from various sources –Semantics extraction: applying the trained tools to the incoming stream of content –Ontology evolution: populating and enriching the ontologies using the results of the extraction task –Information positioning: linking the extracted data to the map data

Semantics extraction  No single modality is powerful enough to support robust and large-scale extraction.  Emphasis on fusion of multiple modalities, using reasoning and handling uncertainty.  Contribution to the state of the art in visual content analysis, due to its richness and the difficulty of extracting semantics.  Non-visual content will provide supportive evidence, to improve precision.

Multimedia semantic model  A multimedia ontology describes the structure of multimedia content and visual characteristics of content objects in terms of low-level features.  One or more domain ontologies, e.g. about athletics.  A geographic ontology, e.g. about landmarks.  An event ontology, e.g. about athletic events.  Potential contribution: –Uncertainty in concept descriptions. –Spatial and temporal relations.

Ontology evolution  Ontology population and enrichment, i.e., addition of concepts, relations, properties and instances.  Coordination of homogeneous ontologies (same domain) and heterogeneous ontologies (e.g. domain and multimedia ontologies).  Potential contribution: –Ontology population from multimedia content. –Combination of different types of reasoning for enrichment and coordination. –Matching, coordination and versioning of the integrated semantic model.

Open issues: semantics extraction  Annotating training data for image and video.  Segment-level and document-level annotation and tracking.  Modeling of modality-specific domain concepts.  Use of entities extracted by one modality in the analysis of another.  Synchronization of different modalities.  The role of the semantic model in fusion and in single-modality analysis.  Support for concept and relation discovery from visual content.  Scalability!

Open issues: semantic model  Do we need to go beyond description logics, e.g. cannot support temporal reasoning in event detection?  What type of uncertainty and how is it going to be incorporated?  Combination of ontologies and reasoning with specialized databases, e.g. geographic.  Identify “detectable” concepts for various modalities.

Open issues: ontology evolution  Combination of different types of reasoning in ontology learning.  Incremental reasoning services to support evolution.  Evaluation of ontology enrichment.  Combination of evidence (e.g. from instances, lexical, etc.) for matching.  Comparison of ontology versions.  Minimization of human involvement!

Open issues: system integration  Implementation of the bootstrapping process, integrating semantic extraction and ontology evolution, through the semantic model.  Crawling for content collection and content quality assessment.  Distributed storage and indexing.  Demonstration of added value for the end user!

BOEMIE workshop BOEMIE 2006 Workshop on Ontology Evolution and Multimedia Information Extraction October 6, 2006, Podebrady, Czech Republic in EKAW th International Conference on Knowledge Engineering and Knowledge Management