MEMT: Multi-Engine Machine Translation 11-731 Machine Translation Alon Lavie February 19, 2007.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Towards Autonomic Adaptive Scaling of General Purpose Virtual Worlds Deploying a large-scale OpenSim grid using OpenStack cloud infrastructure and Chef.
MT Evaluation: Human Measures and Assessment Methods : Machine Translation Alon Lavie February 23, 2011.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Confidence Estimation for Machine Translation J. Blatz et.al, Coling 04 SSLI MTRG 11/17/2004 Takahiro Shinozaki.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
TIDES MT Workshop Review. Using Syntax?  ISI-small: –Cross-lingual parsing/decoding Input: Chinese sentence + English lattice built with all possible.
1 Language Model Adaptation in Machine Translation from Speech Ivan Bulyko, Spyros Matsoukas, Richard Schwartz, Long Nguyen, and John Makhoul.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
Scalable Text Mining with Sparse Generative Models
Microsoft Research Faculty Summit Robert Moore Principal Researcher Microsoft Research.
An Automatic Segmentation Method Combined with Length Descending and String Frequency Statistics for Chinese Shaohua Jiang, Yanzhong Dang Institute of.
Cluster based fact finders Manish Gupta, Yizhou Sun, Jiawei Han Feb 10, 2011.
Machine translation Context-based approach Lucia Otoyo.
Language Identification of Search Engine Queries Hakan Ceylan Yookyung Kim Department of Computer Science Yahoo! Inc. University of North Texas 2821 Mission.
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Statistical Machine Translation Part IV – Log-Linear Models Alex Fraser Institute for Natural Language Processing University of Stuttgart Seminar:
METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.
METEOR: Metric for Evaluation of Translation with Explicit Ordering An Automatic Metric for MT Evaluation with Improved Correlations with Human Judgments.
July 24, 2007GALE Update: Alon Lavie1 Statistical Transfer and MEMT Activities Multi-Engine Machine Translation –MEMT service within the cross-GALE IOD.
Active Learning for Statistical Phrase-based Machine Translation Gholamreza Haffari Joint work with: Maxim Roy, Anoop Sarkar Simon Fraser University NAACL.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
Benk Erika Kelemen Zsolt
Recent Major MT Developments at CMU Briefing for Joe Olive February 5, 2008 Alon Lavie and Stephan Vogel Language Technologies Institute Carnegie Mellon.
Advanced MT Seminar Spring 2008 Instructors: Alon Lavie and Stephan Vogel.
MEMT: Multi-Engine Machine Translation Faculty: Alon Lavie, Robert Frederking, Ralf Brown, Jaime Carbonell Students: Shyamsundar Jayaraman, Satanjeev Banerjee.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Making Watson Fast Daniel Brown HON111. Need for Watson to be fast to play Jeopardy successfully – All computations have to be done in a few seconds –
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
Image Classification for Automatic Annotation
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
MEMT: Multi-Engine Machine Translation Faculty: Alon Lavie, Jaime Carbonell Students and Staff: Gregory Hanneman, Justin Merrill (Shyamsundar Jayaraman,
A New Approach to Utterance Verification Based on Neighborhood Information in Model Space Author :Hui Jiang, Chin-Hui Lee Reporter : 陳燦輝.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
NTU & MSRA Ming-Feng Tsai
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.
July 24, 2007GALE Update: Alon Lavie1 Statistical Transfer and MEMT Activities Chinese-to-English Statistical Transfer MT system (Stat-XFER) –Developed.
CMU Statistical-XFER System Hybrid “rule-based”/statistical system Scaled up version of our XFER approach developed for low-resource languages Large-coverage.
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Faculty: Alon Lavie, Jaime Carbonell Students and Staff: Gregory Hanneman, Justin.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
Confidence Measures As a Search Guide In Speech Recognition Sherif Abdou, Michael Scordilis Department of Electrical and Computer Engineering, University.
September 2004CSAW Extraction of Bilingual Information from Parallel Texts Mike Rosner.
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
MEMT: Multi-Engine Machine Translation Faculty: Alon Lavie, Robert Frederking, Ralf Brown, Jaime Carbonell Students: Shyamsundar Jayaraman, Satanjeev Banerjee.
Discriminative n-gram language modeling Brian Roark, Murat Saraclar, Michael Collins Presented by Patty Liu.
Multiple-goal Search Algorithms and their Application to Web Crawling Dmitry Davidov and Shaul Markovitch Computer Science Department Technion, Haifa 32000,
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
1 Double-Patterning Aware DSA Template Guided Cut Redistribution for Advanced 1-D Gridded Designs Zhi-Wen Lin and Yao-Wen Chang National Taiwan University.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
1 Minimum Bayes-risk Methods in Automatic Speech Recognition Vaibhava Geol And William Byrne IBM ; Johns Hopkins University 2003 by CRC Press LLC 2005/4/26.
Multi-Engine Machine Translation
METEOR: Metric for Evaluation of Translation with Explicit Ordering An Improved Automatic Metric for MT Evaluation Alon Lavie Joint work with: Satanjeev.
Language Technologies Institute Carnegie Mellon University
Monoligual Semantic Text Alignment and its Applications in Machine Translation Alon Lavie March 29, 2012.
Neural Machine Translation by Jointly Learning to Align and Translate
Statistical Machine Translation Part III – Phrase-based SMT / Decoding
CSc4730/6730 Scientific Visualization
Jonathan Elsas LTI Student Research Symposium Sept. 14, 2007
Presenter : Jen-Wei Kuo
Presentation transcript:

MEMT: Multi-Engine Machine Translation Machine Translation Alon Lavie February 19, 2007

Feb 19, : MEMT2 Multi-Engine MT Apply several MT engines to each input in parallel Create a combined translation from the individual translations Goal is to combine strengths, and avoid weaknesses. Along all dimensions: domain limits, quality, development time/cost, run-time speed, etc. Various approaches to the problem

Feb 19, : MEMT3 Multi-Engine MT

Feb 19, : MEMT4 MEMT Goals and Challenges Scientific Challenges: –How to combine the output of multiple MT engines into a selected output that outperforms the originals in translation quality? –Synthetic combination of the output from the original systems, or just selecting the best output (on a sentence-by-sentence basis)? Engineering Challenge: –How to integrate multiple distributed translation engines and the MEMT combination engine in a common framework that supports ongoing development and evaluation

Feb 19, : MEMT5 MEMT Approaches Earliest work on MEMT in early 1990s (PANGLOSS) [pre “ROVER”] Several Main Approaches: –Classification approaches –Lattice Combination and “joint” decoding –Confusion Networks –Alignment-based Synthetic MEMT

Feb 19, : MEMT6 Classification-based Approaches Main Idea: construct a classifier that given several translations for the same input sentence selects the “best” translation (on a sentence-by-sentence basis) Main knowledge source for scoring the individual translations is a standard statistical target-language LM, plus confidence scores for each engine Example: [Tidhar & Kuessner, 2000]

Feb 19, : MEMT7 Lattice-based MEMT Earliest approach, first tried in CMU’s PANGLOSS in 1994, and still active in recent work Main Ideas: –Multiple MT engines each produce a lattice of scored translation fragments, indexed based on source language input –Lattices from all engines are combined into a global comprehensive lattice –Joint Decoder finds best translation (or n-best list) from the entries in the lattice

Lattice-based MEMT: Example El punto de descarge The drop-off point se cumplirá en will comply with el puente Agua Fria The cold Bridgewater El punto de descarge The discharge point se cumplirá en will self comply in el puente Agua Fria the “Agua Fria” bridge El punto de descarge Unload of the point se cumplirá en will take place at el puente Agua Fria the cold water of bridge

Feb 19, : MEMT9 Lattice-based MEMT Main Drawbacks: –Requires MT engines to provide lattice output  difficult to obtain! –Lattice output from all engines must be compatible: common indexing based on source word positions  difficult to standardize! –Common TM used for scoring edges may not work well for all engines –Decoding does not take into account any reinforcements from multiple engines proposing the same translation for any portion of the input

Feb 19, : MEMT10 Consensus Network Approach Main Ideas: –Collapse the collection of linear strings of translations into a minimal consensus network (“sausage” graph) that represents a finite-state automaton –Edges that are supported by multiple engines receive a score that is the sum of their contributing confidence scores –Decode: find the path through the consensus network that has optimal score –Example: [Bangalore et al, 2001]

Feb 19, : MEMT11 Consensus Network Example

Feb 19, : MEMT12 Alignment-based Synthetic MEMT Two Stage Approach: 1.Identify common words and phrases across the translations provided by the engines 2.Decode: search the space of synthetic combinations of words/phrases and select the highest scoring combined translation Example: 1.announced afghan authorities on saturday reconstituted four intergovernmental committees 2.The Afghan authorities on Saturday the formation of the four committees of government

Feb 19, : MEMT13 Alignment-based Synthetic MEMT Two Stage Approach: 1.Identify common words and phrases across the translations provided by the engines 2.Decode: search the space of synthetic combinations of words/phrases and select the highest scoring combined translation Example: 1.announced afghan authorities on saturday reconstituted four intergovernmental committees 2.The Afghan authorities on Saturday the formation of the four committees of government MEMT: the afghan authorities announced on Saturday the formation of four intergovernmental committees

Feb 19, : MEMT14 The Word Alignment Matcher Developed by Satanjeev Banerjee as a component in our METEOR Automatic MT Evaluation metric Finds maximal alignment match with minimal “crossing branches” Allows alignment of: –Identical words –Morphological variants of words –Synonymous words (based on WordNet synsets) Implementation: Clever search algorithm for best match using pruning of sub-optimal sub- solutions

Feb 19, : MEMT15 Matcher Example the sri lanka prime minister criticizes the leader of the country President of Sri Lanka criticized by the country’s Prime Minister

Feb 19, : MEMT16 The MEMT Decoder Algorithm Algorithm builds collections of partial hypotheses of increasing length Partial hypotheses are extended by selecting the “next available” word from one of the original systems Sentences are assumed mostly synchronous: –Each word is either aligned with another word or is an alternative of another word Extending a partial hypothesis with a word “pulls” and “uses” its aligned words with it, and marks its alternatives as “used” Partial hypotheses are scored and ranked Pruning and re-combination Hypothesis can end if any original system proposes an end of sentence as next word

Feb 19, : MEMT17 Scoring MEMT Hypotheses Scoring: –Word confidence score [0,1] based on engine confidence and reinforcement from alignments of the words –LM score based on suffix-array 6-gram LM –Log-linear combination: weighted sum of logs of confidence score and LM score –Select best scoring hypothesis based on: Total score (bias towards shorter hypotheses) Average score per word

Feb 19, : MEMT18 The MEMT Algorithm: Further Issues Parameters: –“lingering word” horizon: how long is a word allowed to linger when words following it have already been used? –“lookahead” horizon: how far ahead can we look for an alternative for a word that is not aligned? –“POS matching”: limit search for an alternative to only words of the same POS –“Chunking”: phrases in an engine can be marked as “chunks” that should not be broken apart

Feb 19, : MEMT19 Example IBM: korea stands ready to allow visits to verify that it does not manufacture nuclear weapons ISI: North Korea Is Prepared to Allow Washington to Verify that It Does Not Make Nuclear Weapons CMU: North Korea prepared to allow Washington to the verification of that is to manufacture nuclear weapons Selected MEMT Sentence : north korea is prepared to allow washington to verify that it does not manufacture nuclear weapons ( )

Feb 19, : MEMT20 System Development and Testing Initial development tests performed on TIDES 2003 Arabic-to-English MT data, using IBM, ISI and CMU SMT system output Preliminary evaluation tests performed on three Arabic-to-English systems and on three Chinese-to-English COTS systems Recent Deployments: –GALE Interoperability Operational Demo (IOD): combining output from IBM, LW and RWTH MT systems –Used in joint ARL/CMU submission to MT Eval-06: combining output from several ARL (mostly) rule- based systems

Feb 19, : MEMT21 Internal Experimental Results: MT-Eval-03 Set Arabic-to-English SystemMETEOR Score System A.4241 System B.4231 System C.4405 Choosing best online translation.4432 MEMT.5185 Best hypothesis generated by MEMT.5883

Feb 19, : MEMT22 ARL/CMU MEMT MT-Eval-06 Results Arabic-to-English BLEUMETEORNISTTER Best Individual Engine Best MEMT BLEUMETEORNISTTER Best Individual Engine Best MEMT NIST Set: GALE Set:

Feb 19, : MEMT23 Architecture and Engineering Challenge: How do we construct an effective architecture for running MEMT within large- scale distributed projects? –Example: GALE Project –Multiple MT engines running at different locations –Input may be text or output of speech recognizers, Output may go downstream to other applications (IE, Summarization, TDT) Approach: Using IBM’s UIMA: Unstructured Information Management Architecture –Provides support for building robust processing “workflows” with heterogeneous components –Components act as “annotators” at the character level within documents

Feb 19, : MEMT24 UIMA-based MEMT MEMT engine set up as a remote server: –Communication over socket connections –Sentence-by-sentence translation Java “wrapper” turns the MEMT service into a UIMA-style annotator component UIMA supports easy integration of the MEMT component into various processing “workflows”: –Input is a “document” annotated with multiple translations –Output is the same “document” with an additional MEMT annotation

Feb 19, : MEMT25 Conclusions New sentence-level MEMT approach with nice properties and encouraging performance results: –15% improvement in initial studies –5-30% improvement in MT-Eval-06 setup Easy to run on both research and COTS systems UIMA-based architecture design for effective integration in large distributed systems/projects –Pilot study has been very positive –Can serve as a model for integration framework(s) under GALE and other projects

Feb 19, : MEMT26 Open Research Issues Main Open Research Issues: –Improvements to the underlying algorithm: Better word and phrase alignments Larger search spaces –Confidence scores at the sentence or word/phrase level –Engines providing phrasal information –Decoding is still suboptimal Oracle scores show there is much room for improvement Need for additional discriminant features Stronger (more discriminant) LMs

Feb 19, : MEMT27 Demo

Feb 19, : MEMT28 References 1994, Frederking, R. and S. Nirenburg. “Three Heads are Better than One”. In Proceedings of the Fourth Conference on Applied Natural Language Processing (ANLP-94), Stuttgart, Germany. 2000, Tidhar, Dan and U. Kessner. “Learning to Select a Good Translation”. In Proceedings of the 17th International Conference on Computational Linguistics (COLING-2000), Saarbrcken, Germany. 2001, Bangalore, S., G. Bordel, and G. Riccardi. “Computing Consensus Translation from Multiple Machine Translation Systems”. In Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop, Italy. 2005, Jayaraman, S. and A. Lavie. "Multi-Engine Machine Translation Guided by Explicit Word Matching". In Companion Volume of Proceedings of the 43th Annual Meeting of the Association of Computational Linguistics (ACL-2005), Ann Arbor, Michigan, June 2005.Jayaraman, S. and A. Lavie. "Multi-Engine Machine Translation Guided by Explicit Word Matching" 2005, Jayaraman, S. and A. Lavie. "Multi-Engine Machine Translation Guided by Explicit Word Matching". In Proceedings of the 10th Annual Conference of the European Association for Machine Translation (EAMT-2005), Budapest, Hungary, May 2005.Jayaraman, S. and A. Lavie. "Multi-Engine Machine Translation Guided by Explicit Word Matching"

Feb 19, : MEMT29 Example IBM: victims russians are one man and his wife and abusing their eight year old daughter plus a ( 11 and 7 years ) man and his wife and driver, egyptian nationality. : ISI: The victims were Russian man and his wife, daughter of the most from the age of eight years in addition to the young girls ) 11 7 years ( and a man and his wife and the bus driver Egyptian nationality. : CMU: the victims Cruz man who wife and daughter both critical of the eight years old addition to two Orient ( 11 ) 7 years ) woman, wife of bus drivers Egyptian nationality. : MEMT Sentence : Selected : the victims were russian man and his wife and daughter of the eight years from the age of a 11 and 7 years in addition to man and his wife and bus drivers egyptian nationality Oracle : the victims were russian man and wife and his daughter of the eight years old from the age of a 11 and 7 years in addition to the man and his wife and bus drivers egyptian nationality young girls

Feb 19, : MEMT30 Example IBM: the sri lankan prime minister criticizes head of the country's : ISI: The President of the Sri Lankan Prime Minister Criticized the President of the Country : CMU: Lankan Prime Minister criticizes her country: MEMT Sentence : Selected: the sri lankan prime minister criticizes president of the country Oracle: the sri lankan prime minister criticizes president of the country's