The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.

Slides:



Advertisements
Similar presentations
Generation of Multimedia TV News Contents for WWW Hsin Chia Fu, Yeong Yuh Xu, and Cheng Lung Tseng Department of computer science, National Chiao-Tung.
Advertisements

1 ECHO - European Chronicles On-line Pasquale Savino I.E.I. - C.N.R. Via Alfieri, Pisa
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Visit the ccScan Website Scan, Import, and Automatically File documents to the Cloud SCAN, IMPORT, AND AUTOMATICALLY FILE DOCUMENTS TO SALESFORCE ® Introduction.
A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Yansong Feng and Mirella Lapata
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Supporting New Business Imperatives Creating a Framework for Interoperable Media Services (FIMS)
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
Bibliothèque nationale de France Tallinn, BnF update: production and development priorities in 2015.
1 Texmex – November 15 th, 2005 Strategy for the future Global goal “Understand” (= structure…) TV and other MM documents Prepare these documents for applications.
Mining the web to improve semantic-based multimedia search and digital libraries
Information Retrieval in Practice
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Toward Semantic Web Information Extraction B. Popov, A. Kiryakov, D. Manov, A. Kirilov, D. Ognyanoff, M. Goranov Presenter: Yihong Ding.
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
1 Archive-It Training University of Maryland July 12, 2007.
1 LOMGen: A Learning Object Metadata Generator Applied to Computer Science Terminology A. Singh, H. Boley, V.C. Bhavsar National Research Council and University.
Analysing Crime-Scene Reports Katerina Pastra and Horacio Saggion University of Sheffield Scene of Crime Information System.
1 Samson Cheung EE 639, Fall 2004 Lecture 1: Applications & Trends Multimedia Information Systems advent: open communicator browser, screen cam, hari’s.
Creating Access to Europe’s Television Heritage Prof. Dr. Sonja de Leeuw (project-coordinator, Utrecht University) Johan Oomen MA (technical director,
Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
WISER Social Sciences: Politics & International Relations Gillian Beattie (Social Science Library) Jane Rawson (Vere Harmsworth Library)
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials Arwen Hutt, University of Tennessee.
1 Bridging the gap between the paper past and digital future.
Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News (proceedings page 255) Mike Dowman Valentin Tablan Hamish Cunningham.
Chapter 6: Information Retrieval and Web Search
Search. Search issues How do we say what we want? –I want a story about pigs –I want a picture of a rooster –How many televisions were sold in Vietnam.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Webarchivering in het Audiovisuele Domein Web archiving in the audiovisual Domain Julia Vytopil- Nederlands Instituut voor Beeld en Geluid Netherlands.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
1 Language Technologies (2) Valentin Tablan University of Sheffield, UK ACAI 05 ADVANCED COURSE ON KNOWLEDGE DISCOVERY.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
VIDEO ACTIVE Creating Access to European Television History Project Update FIAT World Conference, Lisbon October 15th, 2007 Alexander Hecht (ORF, A) –
Video Active and the European Digital Library EDL International Conference Frankfurt am Main, 31/1-1/ Sonja de Leeuw.
General Architecture of Retrieval Systems 1Adrienn Skrop.
LREC – Workshop on Crossing media for Improved Information Access, Genova, Italy, 23 May Cross-Media Indexing in the Reveal-This System Murat Yakici,
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
ESWC 2005, Crete, Greece Semantically Enhanced Television News through Web and Video Integration Multimedia and the Semantic Web workshop Borislav PopovMike.
Multimedia Semantic Analysis in the PrestoSpace Project Valentin Tablan, Hamish Cunningham, Cristian Ursu NLP Research Group University of Sheffield Regent.
Multi-Source Information Extraction Valentin Tablan University of Sheffield.
University of Sheffield, NLP Introduction to Text Mining Module 4: Applications (Part 2)
Information Retrieval in Practice
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Search Engine Architecture
Multimedia Information Retrieval
Searching and browsing through fragments of TED Talks
Content Augmentation for Mixed-Mode News Broadcasts Mike Dowman
Presentation transcript:

The PrestoSpace Project Valentin Tablan

2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual record. Audiovisual media became the new form of cultural expression. These historical, cultural and commercial assets are now entirely at risk from deterioration. PrestoSpace aims to provide technical devices and systems for digital preservation of all types of audio-visual collections.

3 Sheffield NLP Group, January 24 th 2006 The Partners 34 partners (32 existing + 2 to be found) Steering Board: –Institut National de l’Audiovisuel INA (France) –British Broadcasting Corporation BBC (UK) –Radiotelevisione Italiana RAI (Italy) –Joanneum Research JRS (Austria) –Netherlands Institute for Sound and Vision - Beeld en Geluid B&G (The Netherlands) –Oesterreichischer Rundfunk ORF (Austria) –University of Sheffield USFD (UK)

4 Sheffield NLP Group, January 24 th 2006 Project Organisation

5 Sheffield NLP Group, January 24 th 2006 USFD’s Involvement Member of MAD work area –Leader of the Semantic Analysis taskforce Member of the Project Steering Board Member of the Project Technical Board

6 Sheffield NLP Group, January 24 th 2006 Motivation Broadcasters produce many of hours of material daily (BBC has 8 TV and 11 radio national channels) Some of this material can be reused in new productions Access to archive material can be provided by some form of semantic annotation and indexing Manual annotation is time consuming (up to 10x real time) and expensive Currently some 90% of BBC’s output is only annotated at a very basic level

7 Sheffield NLP Group, January 24 th 2006 RichNews A prototype addressing the automation of semantic annotation for multimedia material Not aiming at reaching performance comparable to that of human annotators Fully automatic Aimed at news material, further extensions possible TV and radio news broadcasts from the BBC were used during development and testing

8 Sheffield NLP Group, January 24 th 2006 Overview Input: multimedia file Output: OWL/RDF descriptions of content –Headline (short summary) –List of entities (Person/Location/Organization/…) –Related web pages –Segmentation Multi-source Information Extraction system –Automatic speech transcript –Subtitles/closed captions –Related web pages –Legacy metadata

9 Sheffield NLP Group, January 24 th 2006 Key Problems Obtaining a transcript: Speech recognition produces poor quality transcripts with many mistakes (error rate ranging from 10 to 90%) More reliable sources (subtitles/closed captions) not always available Broadcast segmentation: A news broadcast contains several stories. How do we work out where one starts and another one stops?

10 Sheffield NLP Group, January 24 th 2006 Architecture THISL Speech Recogniser C99 Topical Segmenter TF.IDF Key Phrase Extraction Media File Manual Annotation (Optional) Entity Validation Semantic Index Web-Search and Document Matching KIM Information Extraction Degraded Text Information Extraction

11 Sheffield NLP Group, January 24 th 2006 Using ASR Transcripts ASR is performed by the THISL system. Based on ABBOT connectionist speech recognizer. Optimized specifically for use on BBC news broadcasts. Average word error rate of 29%. Error rate of up to 90% for out of studio recordings.

12 Sheffield NLP Group, January 24 th 2006 Topical Segmentation Uses C99 segmenter: Removes common words from the ASR transcripts. Stems the other words to get their roots. Then looks to see in which parts of the transcripts the same words tend to occur.  These parts will probably report the same story.

13 Sheffield NLP Group, January 24 th 2006 Key Phrase Extraction Term frequency inverse document frequency (TF.IDF): Chooses sequences of words that tend to occur more frequently in the story than they do in the language as a whole. Any sequence of up to three words can be a phrase. Up to four phrases extracted per story.

14 Sheffield NLP Group, January 24 th 2006 Web Search and Document Matching The Key-phrases are used to search on the BBC, and the Times, Guardian and Telegraph newspaper websites for web pages reporting each story in the broadcast. Searches are restricted to the day of broadcast, or the day after. Searches are repeated using different combinations of the extracted key-phrases. The text of the returned web pages is compared to the text of the transcript to find matching stories.

15 Sheffield NLP Group, January 24 th 2006 Using the Web Pages The web pages contain: A headline, summary and section for each story. Good quality text that is readable, and contains correctly spelt proper names. They give more in depth coverage of the stories.

16 Sheffield NLP Group, January 24 th 2006 Semantic Annotation The KIM knowledge management system can semantically annotate the text derived from the web pages: KIM will identify people, organizations, locations etc. KIM performs well on the web page text, but very poorly when run on the transcripts directly. This allows for semantic ontology-aided searches for stories about particular people or locations etcetera. So we could search for people called Sydney, which would be difficult with a text-based search.

17 Sheffield NLP Group, January 24 th 2006 Entity Matching

18 Sheffield NLP Group, January 24 th 2006 Search for Entities

19 Sheffield NLP Group, January 24 th 2006 Story Retrieval

20 Sheffield NLP Group, January 24 th 2006 Evaluation Success in finding matching web pages was investigated. Evaluation based on 66 news stories from 9 half-hour news broadcasts. Web pages were found for 40% of stories. 7% of pages reported a closely related story, instead of that in the broadcast. Results are based on earlier version of the system, only using BBC web pages.

21 Sheffield NLP Group, January 24 th 2006 Future Improvements Use teletext subtitles (closed captions) when they are available Better story segmentation through visual cues. Use for different domains and languages.

22 Sheffield NLP Group, January 24 th 2006 Demo

23 Sheffield NLP Group, January 24 th 2006 More Information