LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
A Visual Programming Environment Based on Graph Grammars and Tidy Graph Drawing Y. Adachi, Y. Miyadera, K. Sugita K. Tsuchida and T. Yaku.
Markpong Jongtaveesataporn † Chai Wutiwiwatchai ‡ Koji Iwano † Sadaoki Furui † † Tokyo Institute of Technology, Japan ‡ NECTEC, Thailand.
Year 3 Objectives: Writing
A Syntactic Translation Memory Vincent Vandeghinste Centre for Computational Linguistics K.U.Leuven
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
Facial expression as an input annotation modality for affective speech-to-speech translation Éva Székely, Zeeshan Ahmed, Ingmar Steiner, Julie Carson-Berndsen.
CS4025: Advanced Information Extraction. Overview CS4025, Department of Computing Science, University of Aberdeen 2 Overview of aspects of IE and General.
J. Kunzmann, K. Choukri, E. Janke, A. Kießling, K. Knill, L. Lamel, T. Schultz, and S. Yamamoto Automatic Speech Recognition and Understanding ASRU, December.
Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
Languages & The Media, 5 Nov 2004, Berlin 1 New Markets, New Trends The technology side Stelios Piperidis
Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News Gina-Anne Levow University of Chicago SIGHAN July 25, 2004.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Evaluating an MT French / English System Widad Mustafa El Hadi Ismaïl Timimi Université de Lille III Marianne Dabbadie LexiQuest - Paris.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
TectoMT two goals of TectoMT –to allow experimenting with MT based on deep- syntactic (tectogrammatical) transfer –to create a software framework into.
Information Retrieval in Practice
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
LDMT MURI Data Collection and Linguistic Annotations November 4, 2011 Jason Baldridge, UT Austin Ulf Hermjakob, USC/ISI.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Data collection and experimentation. Why should we talk about data collection? It is a central part of most, if not all, aspects of current speech technology.
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Real-Time Speech Recognition Subtitling in Education Respeaking 2009 Dr Mike Wald University of Southampton.
Movie scene corpus for language learning Eiichi Yubune (Toyo University), Ryuji Tabuchi (Mint Applications), Akinobu Kanda (Tokyo Metropolitan University),
Profile The METIS Approach Future Work Evaluation METIS II Architecture METIS II, the continuation of the successful assessment project METIS I, is an.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Digital Information and Heritage INFuture Zagreb, Sentence Alignment as the Basis For Translation Memory Database Sanja Seljan Faculty of.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Rundkast at LREC 2008, Marrakech LREC 2008 Ingunn Amdal, Ole Morten Strand, Jørn Almberg, and Torbjørn Svendsen RUNDKAST: An Annotated.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Nespole!’s Experiment on Multimodality (Summer 2001) Erica Costantini (University of Trieste) Fabio Pianesi (ITC-irst, Trento) Susanne Burger (CMU)
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
Using Surface Syntactic Parser & Deviation from Randomness Jean-Pierre Chevallet IPAL I2R Gilles Sérasset CLIPS IMAG.
Artificial Intelligence: Natural Language
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
Text segmentation Amany AlKhayat. Before any real processing is done, text needs to be segmented at least into linguistic units such as words, punctuation,
8 December 1997Industry Day Applications of SuperTagging Raman Chandrasekar.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
Error Analysis of Two Types of Grammar for the purpose of Automatic Rule Refinement Ariadna Font Llitjós, Katharina Probst, Jaime Carbonell Language Technologies.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
ELAN as a tool for oral history CLARIN Oral History Workshop Oxford Sebastian Drude CLARIN ERIC 18 April 2016.
LREC – Workshop on Crossing media for Improved Information Access, Genova, Italy, 23 May Cross-Media Indexing in the Reveal-This System Murat Yakici,
Centre for Translation Studies FACULTY OF ARTS
Digital Video Library - Jacky Ma.
Year 3 Objectives: Writing
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
Peggy van der Kreeft Deutsche Welle
The art of subtitling within the European institutions
Content Augmentation for Mixed-Mode News Broadcasts Mike Dowman
Open Source SUMMA Platform
Presentation transcript:

LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker, W. Daelemans, E. Sklavounou, M. Konstantinou, Y. Karavidas {spip, iason, {walter.daelemans,

LREC 2004, 26 May 2004, Lisbon 2 Subtitling Subtitling aims at enhancing a TV (or similar) programme with a written version of its narrative or dialogue The written version can be in the : –same language as the spoken – monolingual subtitling –different language from the spoken - multilingual subtitling

LREC 2004, 26 May 2004, Lisbon 3 Challenges of Subtitling the challenge (in automated generation) is that there must be agreement between subtitles, the spoken source language and the corresponding image generated subtitles must meet a set of constraints imposed by the visual context of the text and spatio-temporal factors subtitle text is no longer normal written text but rather oral text

LREC 2004, 26 May 2004, Lisbon 4 Experiments in MUSA experiments on monolingual and multilingual subtitle generation Languages : English : source & target French & Greek : target Technologies used –English ASR component for the transcription of audio streams into text –Subtitling component producing English subtitles from English audio transcriptions –Translation component integrating Machine Translation and Translation Memory, for EN-FR & EN-EL

LREC 2004, 26 May 2004, Lisbon 5 Architecture

LREC 2004, 26 May 2004, Lisbon 6 Resources for subtitling in order to train and evaluate system components, an array of application specific resources is necessary primary audiovisual data from BBC World Service, documentaries and “newsy” current affairs programmes for each programme, the following parallel data are sourced the actual video of the programme its script or hand-made transcript English, Greek and French subtitles topically relevant newspaper and web-sourced extracts

LREC 2004, 26 May 2004, Lisbon 7 Resources overview ScriptsTran scripts Scripts +Tran scripts EN sub titles EL sub titles FR sub titles Horizon Panora ma Misc DVDs Totals

LREC 2004, 26 May 2004, Lisbon 8 Speech recognition component Use of parallel corpus of BBC programs, audio and hand-made transcripts, as well as topically relevant newspaper texts Tuning of acoustic and language models of the KUL/ESAT recogniser Background noise & non-native speech hinder the process Alignment of audio with hand-made transcripts proved to be a working solution helping overcome noise and non-native speakers problems

LREC 2004, 26 May 2004, Lisbon 9 Speech recognition component (2)

LREC 2004, 26 May 2004, Lisbon 10 Constraints & Requirements subtitling conventions in various EU countries constraints entail that compression of transcripts’ segments is required compression rate expressed in # of words and # of chars to delete

LREC 2004, 26 May 2004, Lisbon 11 Subtitling engine & resources Use of a parallel corpus of BBC programs featuring program hand-validated transcripts and their hand-made subtitles Align sentences and words in the parallel corpus Extract a table of paraphrases to compress Example –Within the next few years -> Soon –During the years when -> While –It was clear that -> Clearly

LREC 2004, 26 May 2004, Lisbon 12 Subtitling engine & resources (2) If compression rate is not reached by using paraphrasing, apply syntactic rules to delete low-importance units (e.g. adverbs, adjectives, etc) Hand-crafted deletion rules making use of –A shallow-parse of the segments –Surprise values for each word, computed on the basis of a large text corpus. If more deletable segments than necessary exist, start by deleting the least important segments first.

LREC 2004, 26 May 2004, Lisbon 13 Translation component integrate TM (TrAID) and MT (Systran) align EN hand-made subtitles with FR and EL hand-made subtitles build a translation memory database (high % of unique translation units, not unexpected) perform term extraction on the parallel corpus hand-validate automatically extracted terms and use them for translation customisation purposes

LREC 2004, 26 May 2004, Lisbon 14 Subtitle editing responsible for textual operations, tokenisation and subtitle text splitting, calculation of cue-in/cue-out timecodes Requirement : Subtitled text should be segmented at the highest syntactic nodes possible Hand-crafted rules of type: “Cut after punctuation”, “Cut after personal pronouns following a verb phrase”, etc. For EN use of available shallow parse information For FR and EL, use of pos tagging information did not produce worse results

LREC 2004, 26 May 2004, Lisbon 15 Evaluation (in progress) so far, relatively poor ASR results for subtitling purposes alignment mode of ASR yielded >90% accuracy grammaticality and acceptability of subtitles >80% BLEU protocol applied for subtitling evaluation evaluation of translation component and integrated prototype are ongoing

LREC 2004, 26 May 2004, Lisbon 16 Conclusions human subtitling is an extremely complex process a simplified computational model is feasible an architecture for a multilingual subtitling system has been implemented an interesting array of resources has been collected and processed at different levels, yielding useful derivative resources evaluation is even more challenging!