Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated.

Slides:

Advertisements

Similar presentations

Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.

Advertisements

Markpong Jongtaveesataporn † Chai Wutiwiwatchai ‡ Koji Iwano † Sadaoki Furui † † Tokyo Institute of Technology, Japan ‡ NECTEC, Thailand.

Advances in WP2 Torino Meeting – 9-10 March

Novel Reordering Approaches in Phrase-Based Statistical Machine Translation S. Kanthak, D. Vilar, E. Matusov, R. Zens & H. Ney ACL Workshop on Building.

“Applying Morphology Generation Models to Machine Translation” By Kristina Toutanova, Hisami Suzuki, Achim Ruopp (Microsoft Research). UW Machine Translation.

1 Language Model (LM) LING 570 Fei Xia Week 4: 10/21/2009 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAA A A.

Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.

1 Language Model Adaptation in Machine Translation from Speech Ivan Bulyko, Spyros Matsoukas, Richard Schwartz, Long Nguyen, and John Makhoul.

Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.

Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.

1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.

© 2014 The MITRE Corporation. All rights reserved. Stacey Bailey and Keith Miller On the Value of Machine Translation Adaptation LREC Workshop: Automatic.

Search is not only about the Web An Overview on Printed Documents Search and Patent Search Walid Magdy Centre for Next Generation Localisation School of.

Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.

Speech Recognition Final Project Resources

Large Language Models in Machine Translation Conference on Empirical Methods in Natural Language Processing 2007 報告者：郝柏翰 2013/06/04 Thorsten Brants, Ashok.

2012: Monolingual and Crosslingual SMS-based FAQ Retrieval Johannes Leveling CNGL, School of Computing, Dublin City University, Ireland.

2010 Failures in Czech-English Phrase-Based MT 2010 Failures in Czech-English Phrase-Based MT Full text, acknowledgement and the list of references in.

COMPARISON OF A BIGRAM PLSA AND A NOVEL CONTEXT-BASED PLSA LANGUAGE MODEL FOR SPEECH RECOGNITION Md. Akmal Haidar and Douglas O’Shaughnessy INRS-EMT,

1 Sentence-extractive automatic speech summarization and evaluation techniques Makoto Hirohata, Yosuke Shinnaka, Koji Iwano, Sadaoki Furui Presented by.

Coşkun Mermer, Hamza Kaya, Mehmet Uğur Doğan National Research Institute of Electronics and Cryptology (UEKAE) The Scientific and Technological Research.

NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.

Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling Ferhan Ture and Jimmy Lin University of Maryland,

1 Boostrapping language models for dialogue systems Karl Weilhammer, Matthew N Stuttle, Steve Young Presenter: Hsuan-Sheng Chiu.

1 Sentence Extraction-based Presentation Summarization Techniques and Evaluation Metrics Makoto Hirohata, Yousuke Shinnaka, Koji Iwano and Sadaoki Furui.

Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.

Yuya Akita , Tatsuya Kawahara

1 Unsupervised Adaptation of a Stochastic Language Model Using a Japanese Raw Corpus Gakuto KURATA, Shinsuke MORI, Masafumi NISHIMURA IBM Research, Tokyo.

Copyright © 2013 by Educational Testing Service. All rights reserved. Evaluating Unsupervised Language Model Adaption Methods for Speaking Assessment ShaSha.

IR&NLP Coursework P1 Text Analysis Within The Fields Of Information Retrieval and Natural Language Processing By Ben Addley Academic Year 2004.

St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences Recurrent Neural Network-based Language Modeling for an Automatic.

1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.

A Simple English-to-Punjabi Translation System By : Shailendra Singh.

Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,

Recent Paper of Md. Akmal Haidar Meeting before ICASSP 2013 報告者：郝柏翰 2013/05/23.

English-Lithuanian-English Lexicon Database Management System for MT Gintaras Barisevicius and Elvinas Cernys Kaunas University of Technology, Department.

RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,

Xiaolin Wang Andrew Finch Masao Utiyama Eiichiro Sumita

Automatic Speech Recognition

Speaker : chia hua Authors : Long Qin, Ming Sun, Alexander Rudnicky

Text-To-Speech System for English

Conditional Random Fields for ASR

KantanNeural™ LQR Experiment

Deep Exploration and Filtering of Text (DEFT)

Joint Training for Pivot-based Neural Machine Translation

Statistical NLP: Lecture 13

Introduction to Textual Analysis

ASSESSING THE USABILITY OF MODERN STANDARD ARABIC DATA IN ENHANCING THE LANGUAGE MODEL OF LIMITED SIZE DIALECT CONVERSATIONS Authers:- Tiba Zaki Abulhameed.

--Mengxue Zhang, Qingyang Li

40 years of research on speech and speaker recognition

Tagging and Statistically Translating Latin Sentences

Anastassia Loukina, Klaus Zechner, James Bruno, Beata Beigman Klebanov

Latin: The Written Language

Eiji Aramaki* Sadao Kurohashi* * University of Tokyo

King Saud University, Riyadh, Saudi Arabia

Memory-augmented Chinese-Uyghur Neural Machine Translation

Statistical Machine Translation Papers from COLING 2004

Statistical vs. Neural Machine Translation: a Comparison of MTH and DeepL at Swiss Post’s Language service Lise Volkart – Pierrette Bouillon – Sabrina.

Domain Mixing for Chinese-English Neural Machine Translation

Research on the Modeling of Chinese Continuous Speech Recognition

Cheng-Kuan Wei1 , Cheng-Tao Chung1 , Hung-Yi Lee2 and Lin-Shan Lee2

University of Illinois System in HOO Text Correction Shared Task

A Path-based Transfer Model for Machine Translation

Anthor: Andreas Tsiartas, Prasanta Kumar Ghosh,

Extracting Why Text Segment from Web Based on Grammar-gram

Deep Neural Network Language Models

Da-Rong Liu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan Lee

Neural Machine Translation by Jointly Learning to Align and Translate

CS249: Neural Language Model

Emre Yılmaz, Henk van den Heuvel and David A. van Leeuwen

Presentation transcript:

Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated Text

Overview Research introduction Research introduction Previous work Previous work Improving language model with translated text Improving language model with translated text Experimental Scenario Experimental Scenario Data (AM, LM) Data (AM, LM) Results Results Conclusion Conclusion Future work Future work

Research Introduction Icelandic weather information speech recognition system Icelandic weather information speech recognition system Speech recogniser needs large data Speech recogniser needs large data –Acoustic Model –Language Model Text data is often very hard and expensive to obtain, i.e. spontaneous speech text Text data is often very hard and expensive to obtain, i.e. spontaneous speech text

Resource Deficient Languages Resource deficient languages – –How many languages are spoken today? – –More than languages including dialects – –Icelandic (population ??????) Around people live in Iceland –The Icelandic people are very proud of their language (it is probably the largest part of our culture) –We have been trying to save it from foreign influences Computer -> Tölva

Resource Deficient Languages How can resource deficient languages be helped?? How can resource deficient languages be helped?? Resource rich languages e.g. English, Japanese Resource deficient language

Translation Methods How can resource deficient languages be helped?? How can resource deficient languages be helped?? Using translated data may be useful !! Using translated data may be useful !! –Manual Often hard work Often hard work –Machine Translation Sentence-by-sentence Sentence-by-sentence –Large parallel corpus / rule based system is needed for MT –Previous research in this area: Nakajima, H., Yamamoto, H., Watanabe, T. “Language Model Adaptation with Additional Text Generated by Machine Translation”, Proc. COLING, 2002, vol 2, pp Nakajima, H., Yamamoto, H., Watanabe, T. “Language Model Adaptation with Additional Text Generated by Machine Translation”, Proc. COLING, 2002, vol 2, pp Word-by-word Word-by-word –Only a dictionary is needed (often easy to obtain)

Previous Work Extended –This paper extends our previous paper [Jensson, 2005] –The dictionary used to translate word-by-word is now created automatically by a simple rule based machine translation system –This paper also introduces sentence-by-sentence machine translated texts from English to Icelandic –More data was used in the experiments Evaluation: 2 hours instead of 6 minutes Evaluation: 2 hours instead of 6 minutes Manually translated text was increased Manually translated text was increased Each experiment was performed three times with randomly chosen text to increase reliability Each experiment was performed three times with randomly chosen text to increase reliability

Translation Methods Sentence-by-Sentence (SBS) machine translation can be applied to any language pairs Sentence-by-Sentence (SBS) machine translation can be applied to any language pairs –Rule based machine translation –Statistical machine translation Word-by-Word (WBW) translation is expected to be useful for closely grammatically related languages Word-by-Word (WBW) translation is expected to be useful for closely grammatically related languages –English vs. French –English vs. Icelandic SVO SVO WORKS –English vs. Japanese SVO SOV DOES NOT WORK

Translation Method (WBW) Creation of a dictionary and how it is used to translate in our system Creation of a dictionary and how it is used to translate in our system Note that large text data for the target domain often exist in other languages Note that large text data for the target domain often exist in other languages

Core structure Language models from translated corpus and original text corpus interpolated together, creating a new Language Model (LM3) Language models from translated corpus and original text corpus interpolated together, creating a new Language Model (LM3) Interpolation formula Interpolation formula All language models were built using 3-grams with Kneser-Ney smoothing All language models were built using 3-grams with Kneser-Ney smoothing TRT ST Sparse Text (ST) Translated Rich Text (TRT)

Experimental Scenario English to Icelandic English to Icelandic Speech recognition experiments done with word-by-word and sentence-by-sentence (rule based) translated text in the weather information domain Speech recognition experiments done with word-by-word and sentence-by-sentence (rule based) translated text in the weather information domain The Jupiter system (a weather information system developed by MIT) was used as the English corpus The Jupiter system (a weather information system developed by MIT) was used as the English corpus Rich language English Sparse language Icelandic Evaluation Speech recognition WER Translation Method BLEU 1-gram BLEU 2-gram WBW SBS

Icelandic Acoustic Model Training: Training: Attribute Acoustic Corpus No. male speakers 13 No. female speakers 7 Time (hours) 3.8 Read speech from a bi- phonetically balanced text corpus

Icelandic Evaluation Set Attribute Acoustic Corpus No. male speakers 10 No. female speakers 10 No. utterances 4000 Time (hours) 2.0 Weather information domain Evaluation set:

Training Text Data The text data is as follows: The text data is as follows: Corpus Set SentencesWords Unique Words ST TRT WBW TRT SBS Manually created Icelandic sparse text Machine translated text from the English Jupiter (*) corpus to Icelandic * The Jupiter system, a weather information system developed by MIT

Results Baseline OOV=14.0% 6% relative WER improvement OOV=8.4% 15.5 % relative WER improvement OOV=4.4% Convergence point for the WBW MT Still improvements

Conclusion The results presented show that an LM can be improved considerably using either WBW or SBS translation The Word by Word MT is especially important for resource deficient languages that do not have SBS machine translation tools available The results presented show that a LM can be improved considerably – –WBW improves up to 6% (WER) – –SBS improves up to 15.5% (WER)

Future Work Large vocabulary speech recognition experiments Large vocabulary speech recognition experiments –We have already collected a corpus collected from 20 people in the news domain –We plan to perform both WBW and SBS MT experiments on this corpus Create a statistical machine translator (SBS) that is trained on sparse parallel text and then translate large documents to the target language Create a statistical machine translator (SBS) that is trained on sparse parallel text and then translate large documents to the target language –Perform sentence selection and adaptation methods on the machine translated corpora –Use corpora from language A to translate to language B and then finally to the target language using the WBW method –Etc.

Thank you for your attention Questions?