NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.

Slides:



Advertisements
Similar presentations
Statistical Machine Translation
Advertisements

The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser Institute for Natural Language Processing University of Stuttgart
Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Re-evaluating Bleu Alison Alvarez Machine Translation Seminar February 16, 2006.
Dependency-Based Automatic Evaluation for Machine Translation Karolina Owczarzak, Josef van Genabith, Andy Way National Centre for Language Technology.
Word Sense Disambiguation for Machine Translation Han-Bin Chen
Hybridity in MT: Experiments on the Europarl Corpus Declan Groves 24 th May, NCLT Seminar Series 2006.
1 A Tree Sequence Alignment- based Tree-to-Tree Translation Model Authors: Min Zhang, Hongfei Jiang, Aiti Aw, et al. Reporter: 江欣倩 Professor: 陳嘉平.
A Tree-to-Tree Alignment- based Model for Statistical Machine Translation Authors: Min ZHANG, Hongfei JIANG, Ai Ti AW, Jun SUN, Sheng LI, Chew Lim TAN.
Flow Network Models for Sub-Sentential Alignment Ying Zhang (Joy) Advisor: Ralf Brown Dec 18 th, 2001.
Novel Reordering Approaches in Phrase-Based Statistical Machine Translation S. Kanthak, D. Vilar, E. Matusov, R. Zens & H. Ney ACL Workshop on Building.
Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.
“Applying Morphology Generation Models to Machine Translation” By Kristina Toutanova, Hisami Suzuki, Achim Ruopp (Microsoft Research). UW Machine Translation.
TIDES MT Workshop Review. Using Syntax?  ISI-small: –Cross-lingual parsing/decoding Input: Chinese sentence + English lattice built with all possible.
Minimum Error Rate Training in Statistical Machine Translation By: Franz Och, 2003 Presented By: Anna Tinnemore, 2006.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
Machine Translation A Presentation by: Julie Conlonova, Rob Chase, and Eric Pomerleau.
Symmetric Probabilistic Alignment Jae Dong Kim Committee: Jaime G. Carbonell Ralf D. Brown Peter J. Jansen.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
A Hierarchical Phrase-Based Model for Statistical Machine Translation Author: David Chiang Presented by Achim Ruopp Formulas/illustrations/numbers extracted.
Corpora and Translation Parallel corpora Statistical MT (not to mention: Corpus of translated text, for translation studies)
9/12/2003LTI Student Research Symposium1 An Integrated Phrase Segmentation/Alignment Algorithm for Statistical Machine Translation Joy Advisor: Stephan.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
Statistical Machine Translation Part IX – Better Word Alignment, Morphology and Syntax Alexander Fraser ICL, U. Heidelberg CIS, LMU München
Statistical Machine Translation Part VIII – Log-Linear Models Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Statistical Machine Translation Part III – Phrase-based SMT / Decoding Alex Fraser Institute for Natural Language Processing University of Stuttgart
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Technical Report of NEUNLPLab System for CWMT08 Xiao Tong, Chen Rushan, Li Tianning, Ren Feiliang, Zhang Zhuyu, Zhu Jingbo, Wang Huizhen
Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.
Direct Translation Approaches: Statistical Machine Translation
Syntax for MT EECS 767 Feb. 1, Outline Motivation Syntax-based translation model  Formalization  Training Using syntax in MT  Using multiple.
Statistical Machine Translation Part IV – Log-Linear Models Alex Fraser Institute for Natural Language Processing University of Stuttgart Seminar:
Statistical Machine Translation Part V - Advanced Topics Alex Fraser Institute for Natural Language Processing University of Stuttgart Seminar:
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Grammatical Machine Translation Stefan Riezler & John Maxwell.
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
Statistical Machine Translation Part IV – Log-Linear Models Alexander Fraser Institute for Natural Language Processing University of Stuttgart
Sanjay Chatterji Dev shri Roy Sudeshna Sarkar Anupam Basu CSE, IIT Kharagpur A Hybrid Approach for Bengali to Hindi Machine Translation.
Statistical Machine Translation Part V – Better Word Alignment, Morphology and Syntax Alexander Fraser CIS, LMU München Seminar: Open Source.
2010 Failures in Czech-English Phrase-Based MT 2010 Failures in Czech-English Phrase-Based MT Full text, acknowledgement and the list of references in.
A daptable A utomatic E valuation M etrics for M achine T ranslation L ucian V lad L ita joint work with A lon L avie and M onica R ogati.
The ICT Statistical Machine Translation Systems for IWSLT 2007 Zhongjun He, Haitao Mi, Yang Liu, Devi Xiong, Weihua Luo, Yun Huang, Zhixiang Ren, Yajuan.
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.
Compact WFSA based Language Model and Its Application in Statistical Machine Translation Xiaoyin Fu, Wei Wei, Shixiang Lu, Dengfeng Ke, Bo Xu Interactive.
What’s in a translation rule? Paper by Galley, Hopkins, Knight & Marcu Presentation By: Behrang Mohit.
Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.
Alignment of Bilingual Named Entities in Parallel Corpora Using Statistical Model Chun-Jen Lee Jason S. Chang Thomas C. Chuang AMTA 2004.
NRC Report Conclusion Tu Zhaopeng NIST06  The Portage System  For Chinese large-track entry, used simple, but carefully- tuned, phrase-based.
LREC 2008 Marrakech 29 May Caroline Lavecchia, Kamel Smaïli and David Langlois LORIA / Groupe Parole, Vandoeuvre-Lès-Nancy, France Phrase-Based Machine.
A non-contiguous Tree Sequence Alignment-based Model for Statistical Machine Translation Jun Sun ┼, Min Zhang ╪, Chew Lim Tan ┼ ┼╪
Haitham Elmarakeby.  Speech recognition
Imposing Constraints from the Source Tree on ITG Constraints for SMT Hirofumi Yamamoto, Hideo Okuma, Eiichiro Sumita National Institute of Information.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
NLP. Machine Translation Tree-to-tree – Yamada and Knight Phrase-based – Och and Ney Syntax-based – Och et al. Alignment templates – Och and Ney.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
Towards Syntactically Constrained Statistical Word Alignment Greg Hanneman : Advanced Machine Translation Seminar April 30, 2008.
Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.
October 10, 2003BLTS Kickoff Meeting1 Transfer with Strong Decoding Learning Module Transfer Rules {PP,4894} ;;Score: PP::PP [NP POSTP] -> [PREP.
A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, et al. ACL 2009.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
LING 575 Lecture 5 Kristina Toutanova MSR & UW April 27, 2010 With materials borrowed from Philip Koehn, Chris Quirk, David Chiang, Dekai Wu, Aria Haghighi.
Neural Machine Translation
Tools for Natural Language Processing Applications
Suggestions for Class Projects
Statistical Machine Translation Papers from COLING 2004
The XMU SMT System for IWSLT 2007
Dekai Wu Presented by David Goss-Grubbs
Presentation transcript:

NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China

Outline System Description Word Alignment Phrase Pair Extracting Example-Based Decoder Experiments Future Works

System Description Three Main Components Word Alignment Phrase Pair Extraction Example-based Decoder

Word Alignment - (1) DP algorithm and log-linear model to compute the word alignment Feature functions ITG constraint Word co-occurrence probability Distortion model Word alignment a binary branching tree satisfy ITG constraint many-to-many alignment

Word Alignment - (2)

Phrase Pair Extraction Extracting phrase pairs each constituent sub-tree is a phrase pair (block) Building models Direct and inverse word- and phrase-based TM Reordering model over all block pairs Chinese English 我 i 再次 once again 检查 checked 我 的我 的 my 包 Bag 我的 包 my bag 检查 我的 包 checked my bag 再次 检查 我的 包 checked my bag once again 我 再次 检查 我的 包 i checked my bag once again

Decoding Log-linear model used to score the partial hypotheses Baseline: CKY style decoder Example-based decoder Examples retrieval Generation: matching and merging

Example-based Decoder - (1) Examples retrieval Collect source blocks from input Collect (top N) phrase pairs from the TM for each source block If an example (sentence-pair) can be matched by at least one phrase-pair, it is a valid example. Keep top M examples for a specific phrase- pair

Example-based Decoder - (2) Matching Matching the input with each example get a translation template the translation template forms a new ITG tree. Decoding each un-translated string (child input) iteratively get a set of child translation templates.

Example-based Decoder - (3)

Example-based Decoder - (4)

Example-based Decoder - (5)

Example-based Decoder - (6) Merging merging the child templates with the parent templates. consider the child template as a sub-tree merge it with the parent ITG tree.

Example-based Decoder - (7)

Example-based Decoder - (8) beam search scoring function for each translation template is:

Experiments-(1) We take part in the C-E task Training corpus 39,953 sentence pairs development set IWSLT07_devset3_* 506 Chinese sentences and 16 references test set IWSLT07_devset2_* 500 Chinese sentences and 16 references official test set 489 Chinese sentences

Experiments-(2) Preprocessing tokenization lowercasing Stemming by using a morphological dictionary ChineseEnglish (stemmed) Train. Data Sentences39,963 Words351,060377,890 Vocabulary11,3027,610 Dev. Set Sentences506 Words3,826 Test Set Sentences489 Words3,189

Experiments-(3) Baseline CKY decoder Results for the stemmed corpus Decoder Bleu Dev setTest Set Official Test set CKY- Decoder EB- Decoder

Experiments-(4) Post-Processing Morphological changes using a morphological dictionary and 3-gram LM Case sensitive outputs: two simple rules Uppercasing the first word of a sentence Changing the word “ i ” to “ I ”

Experiments-(5) Reasons for the result the data becomes more sparse the post-processing is too simple Decoder Bleu Dev set Test set Official Test set CKY-Decoder EB-Decoder

Future Works Better examples retrieval algorithm. More feature functions in log-linear model. Improving morphological generation.

Thank Dr. Boxing Chen for the presentation and the useful suggestions Thank you for your attention. Any Questions?

Main References Philipp Koehn, Franz Josef Och and Daniel Marcu: “ Statistical Phrase-Based Translation ”. In NAACL/HLT 2003, pages (2003) David Chiang: “ A Hierarchical Phrase-Based Model for Statistical Machine Translation ”. In Proc. of ACL 2005, pages 263 – 270 (2005) Franz Joseph Och and Hermann Ney: “ Discriminative training and maximum entropy models for statistical machine translation ”. In Proceedings of the 40th Annual Meeting of the ACL, pp. 295 – 302(2002) Dekai Wu: “ Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora ”. Computational Linguistics, 23(3):374(1997) Franz Joseph Och and Hermann Ney: “ A Systematic Comparison of Various Statistical Alignment Models ”. Computational Linguistics, 29(1):19 – 52, March(2003) Wen-Han Chao and Zhou-Jun Li: “ Incorporating Constituent Structure Constraint into Discriminative Word Alignment ”, MT Summit XI, Copenhagen, Denmark, September 10-14, 2007, accepted. (2007) R. Moore. “ A discriminative framework for bilingual word alignment ”. In Proceedings of HLT-EMNLP, pages 81 – 88, Vancouver, Canada, October. (2005) I. Dan Melamed. “ Models of Translational Equivalence among Words ”. Computational Linguistics, 26(2):221 – 249. (2000) Taro Watanabe and Eiichiro Sumita: “ Example-based Decoding for Statistical Machine Translation ”. In Machine Translation Summit IX pp (2003) Kishore Papineni, Salim Roukos, Todd Ward and Wei-Jing Zhu: “ BLEU: a Method for Automatic Evaluation of Machine Translation ”. In Proceedings of the 40th Annual Meeting of the Association fo Computational Linguistics(ACL), Philadelphia, July 2002, pp (2002) A. Stolcke, “ SRILM – An extensible language modeling toolkit, ” in Proceedings of the International Conference on Spoken Language Processing, Denver, Colorado, 2002, pp. 901 – 904