MUMIS Franciska de Jong & Thijs Westerveld University of Twente Multimedia Indexing and Searching.

Slides:



Advertisements
Similar presentations
Generation of Multimedia TV News Contents for WWW Hsin Chia Fu, Yeong Yuh Xu, and Cheng Lung Tseng Department of computer science, National Chiao-Tung.
Advertisements

D2 Conceration, Vilamoura, April 16th Video & Image Indexing and Retrieval in the Large Scale V T A L A S Progress Nozha Boujemaa, Scientific Coordinator.
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
Information Extraction from Spoken Language Dr Pierre Dumouchel Scientific Vice-President, CRIM Full Professor, ÉTS.
MUMIS User Group Workshop P. Wittenburg Max-Planck-Institut für Psycholinguistik Nijmegen.
Speech Recognition Part 3 Back end processing. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
ROSIDS - R apid O pen S ource I ntelligence D eployment S ystem Mark P. Pfeiffer, SAIL LABS Technology AG August 7, 2006.
DL:Lesson 11 Multimedia Search Luca Dini
ACCESSIBLE TECHNOLOGIES FOR SPEECH MANAGEMENT “Making media accessible to all” ITU workshop – Geneva October 2013.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC Marthie de Kock The Hong Kong Institute of Education 9 December 2002.
Mining the web to improve semantic-based multimedia search and digital libraries
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Named Entity Recognition for Digitised Historical Texts by Claire Grover, Sharon Givon, Richard Tobin and Julian Ball (UK) presented by Thomas Packer 1.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
User Generated Content, Folksonomies and how they can be combined Max Arends Electronic Commerce Group VSEM – The Virtual 3D Social.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Using Information Extraction for Question Answering Done by Rani Qumsiyeh.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/2010 Overview of NLP tasks (text pre-processing)
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP3 – Retrieval systems.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Smart Learning Services Based on Smart Cloud Computing
Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen.
® Automatic Scoring of Children's Read-Aloud Text Passages and Word Lists Klaus Zechner, John Sabatini and Lei Chen Educational Testing Service.
What’s the difference between Tony Blair and Mother Theresa? (Human Language Technology for Preservation return on investment)
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
GATE, a General Architecture for Text Engineering Hamish Cunningham Department.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Institute of Informatics and Telecommunications – NCSR “Demokritos” Bootstrapping ontology evolution with multimedia information extraction C.D. Spyropoulos,
Project 1 Online multi-user video monitoring system.
Using the GATE Architecture for NE Recognition in the Football Domain Horacio Saggion, Hamish Cunningham, Diana Maynard, Yorick Wilks Department of Computer.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.
A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 10, NO. 3, APRIL 2008.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
Overview of the merger prototype. Overview Backgrounds: The MUMIS project Cross document annotation merging Alignment of parallel fragments Unification.
1 BILC SEMINAR 2009 Speech Recognition: Is It for Real? Tony Mirabito Defense Language Institute English Language Center (DLIELC) DLIELC.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News (proceedings page 255) Mike Dowman Valentin Tablan Hamish Cunningham.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
[D2.5] Object model and metadata: Open issues Workgroups Kick-off meeting – 2 & 3 April 2009 Julie Verleyen.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
MIND: An architecture for multimedia information retrieval in federated digital libraries Henrik Nottelmann University of Dortmund, Germany.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and Audio Brandon Muramatsu Andrew McKinney
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Translingual Information Management Stephan Busemann Language Technology Lab German Research Center for Artificial Intelligence.
MedKAT Medical Knowledge Analysis Tool December 2009.
NCSR “Demokritos” Institute of Informatics & Telecommunications CROSSMARC CROSS-lingual Multi Agent Retail Comparison Costas Spyropoulos & Vangelis Karkaletsis.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
© 2003 DFKI Language Technology Lab Language Technology Information Extraction Retrieving relevant concepts and structured relations in unrestricted free.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
Learning Technology Development. edgehill.ac.uk Online Submission Workshop edgehill.ac.uk How to create an assignment dropbox? Assignment Template Dates.
Genoa – May 23, 2006 LREC workshop From Media Crossing to Media Mining Franciska de Jong University of Twente/TNO ICT
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
Multimedia Semantic Analysis in the PrestoSpace Project Valentin Tablan, Hamish Cunningham, Cristian Ursu NLP Research Group University of Sheffield Regent.
Multi-Source Information Extraction Valentin Tablan University of Sheffield.
Visual Information Retrieval
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
User Requirements in the Cultural Heritage Domain
TITLE Authors Institution
Lecture 8 Information Retrieval Introduction
Content Augmentation for Mixed-Mode News Broadcasts Mike Dowman
Open Source SUMMA Platform
Presentation transcript:

MUMIS Franciska de Jong & Thijs Westerveld University of Twente Multimedia Indexing and Searching

OBJECTIVES Automatically indexing of video Data from different media sources (paper, radio, tv) Domain: soccer Digitise + ASR Extract significant events Merge annotations Store final annotations UI for searching

FACTS SHEET Title: MUMIS: Multimedia Indexing and Searching Environment Funding: EU Language Engineering Sector of TAP Duration: 30 months July 2000 – January 2003 Volume: 2.4 M Euro, 385 Person months Languages:Dutch, English, German (Swedish)

Consortium University of Twente (NL) Sheffield University (UK) University of Nijmegen (NL) DFKI LT-Lab (DE) Max Planck Institute for Psycholinguistics (DE) Esteam (SE) VDA (NL)

Offline Processing Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Speech Transcr ASR EN DE Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Free Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text IE Merged Annotated formal text NL Information Extraction Automatic Speech Recognition Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Formal Text Speech Signals Merging Annotations Formal Text Formal Text Formal Text Anno- tations Merging

DOMAIN MODELLING DATA: text, video, audio Location …... Defender a ’Defender’ is a … Player Annotations Multilingual IE Multilingual Search... Player:… Consequence:… Time :… Location:... Multilingual Lexicons ENTITY EVENT RELATION Time Date PersonScoreObject Defender Official Artifact Stopper Goal Player:… Cause:… Time:…... Player Foul

SPEECH RECOGNITION Large-vocabulary Speaker independent Phoneme-based Hidden Markov models acoustic model language model Emotionally coloured speech Domain language model Match specific vocabularies (player names)

INFORMATION EXTRACTION multilingual formal descriptions closed captions tickers newspapers ASR output (radio/TV comment)

IE DATA Formal text Schoten op doel 4 4 Schoten naast doel 6 7 Overtredingen Gele kaarten 1 1 Rode kaarten 0 1 Hoekschoppen 3 5 Buitenspel 4 1 Ticker 24 Scholes beats Jens Jeremies wonderfully, dragging the ball around and past the Bayern Munich man. He then finds Michael Owen on the right wing, but Owen's cross is poor. TV report Scholes Past Jeremies Owen Newspaper Owen header pushed onto the post Deisler brought the German supporters to their feet with a buccaneering run down the right. Moments later Dietmar Hamann managed the first shot on target but it was straight at David Seaman. Mehmet Scholl should have done better after getting goalside of Phil Neville inside the area from Jens Jeremies’ astute pass but he scuffed his shot.

He then finds Michael Owen on the right wing PASS player1 = Scholes player2 = Owen. He Scholes then finds Michael Owen on the right wing … He then finds VP Michael Owen on the right wing NP but Owen's cross NP 24 Scholes beats Jens Jeremies wonderfully, dragging Scholes beat Jens Jeremies wonderfull, drag NUM Scholes PROP beatVERB 3p sing Jens PROP Jeremies PROP wonderfullADV, PUNCT Scholes beats Jens Jeremies wonderfully, dragging the ball around and past the Bayern Munich man. He then finds Michael Owen on the right wing, but Owen's cross is poor. IE Techniques & resources Tokenisation Lemmatisation POS + morphology Named Entities Shallow parsing Co-reference resolution Template filling 24 time Scholes player beat Jens Jeremies player wonderfull, …

MERGING Fuse annotations and recover from errors and differences: Multiple annotations of the same event (possibly with different attributes, e.g. time). Wrong event descriptions because of information extraction errors. Merging multiple partial annotations, e.g. by solving unsolved references like “star player”. Description logic

ON-LINE TASKS Search for interesting events with formal questions (user interface in many languages) Indicate hits by thumbnails & let user select scene Play scene via the Internet & allow scrolling Give me all goals from Overmars shot with his head in 1. Half. Event=Goal; Scorer=Overmars; Cause=Head; Time<=45 PSV - Ajax 1995 Ned - Eng 1998 Ned - Ger 1998 Multilingual Search and Display

SUMMARY Multimedia and multilingual ASR on emotionally coloured speech IE on ASR output Merging different annotations Search archives and play video online ml