Monoligual Semantic Text Alignment and its Applications in Machine Translation Alon Lavie March 29, 2012.

Slides:



Advertisements
Similar presentations
Statistical Machine Translation
Advertisements

The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser Institute for Natural Language Processing University of Stuttgart
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
MEANT: semi-automatic metric for evaluating for MT evaluation via semantic frames an asembling of ACL11,IJCAI11,SSST11 Chi-kiu Lo & Dekai Wu Presented.
TAP-ET: TRANSLATION ADEQUACY AND PREFERENCE EVALUATION TOOL Mark Przybocki, Kay Peterson, Sébastien Bronsart May LREC 2008 Marrakech, Morocco.
MT Evaluation: Human Measures and Assessment Methods : Machine Translation Alon Lavie February 23, 2011.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
Chinese Word Segmentation Method for Domain-Special Machine Translation Su Chen; Zhang Yujie; Guo Zhen; Xu Jin’an Beijing Jiaotong University.
Confidence Estimation for Machine Translation J. Blatz et.al, Coling 04 SSLI MTRG 11/17/2004 Takahiro Shinozaki.
Symmetric Probabilistic Alignment Jae Dong Kim Committee: Jaime G. Carbonell Ralf D. Brown Peter J. Jansen.
1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.
Microsoft Research Faculty Summit Robert Moore Principal Researcher Microsoft Research.
© 2014 The MITRE Corporation. All rights reserved. Stacey Bailey and Keith Miller On the Value of Machine Translation Adaptation LREC Workshop: Automatic.
A New Approach for Cross- Language Plagiarism Analysis Rafael Corezola Pereira, Viviane P. Moreira, and Renata Galante Universidade Federal do Rio Grande.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.
Achieving Domain Specificity in SMT without Over Siloing William Lewis, Chris Wendt, David Bullock Microsoft Research Machine Translation.
Automated Metrics for MT Evaluation : Machine Translation Alon Lavie March 2, 2011.
Evaluation of the Statistical Machine Translation Service for Croatian-English Marija Brkić Department of Informatics, University of Rijeka
METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.
METEOR: Metric for Evaluation of Translation with Explicit Ordering An Automatic Metric for MT Evaluation with Improved Correlations with Human Judgments.
An Algorithm to Align Words for Historical Comparison Michael A. Covington (The University of Georgia) Journal of Computational Linguistics 1996 February.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
2010 Failures in Czech-English Phrase-Based MT 2010 Failures in Czech-English Phrase-Based MT Full text, acknowledgement and the list of references in.
MIRACLE Multilingual Information RetrievAl for the CLEF campaign DAEDALUS – Data, Decisions and Language, S.A. Universidad Carlos III de.
Recent Major MT Developments at CMU Briefing for Joe Olive February 5, 2008 Alon Lavie and Stephan Vogel Language Technologies Institute Carnegie Mellon.
A daptable A utomatic E valuation M etrics for M achine T ranslation L ucian V lad L ita joint work with A lon L avie and M onica R ogati.
Automatic Post-editing (pilot) Task Rajen Chatterjee, Matteo Negri and Marco Turchi Fondazione Bruno Kessler [ chatterjee | negri | turchi
Advanced MT Seminar Spring 2008 Instructors: Alon Lavie and Stephan Vogel.
Gerrit Schutte OHIM 9th of December, 2011 Trademark terminology control.
Approximating a Deep-Syntactic Metric for MT Evaluation and Tuning Matouš Macháček, Ondřej Bojar; {machacek, Charles University.
MEMT: Multi-Engine Machine Translation Faculty: Alon Lavie, Robert Frederking, Ralf Brown, Jaime Carbonell Students: Shyamsundar Jayaraman, Satanjeev Banerjee.
Korea Maritime and Ocean University NLP Jung Tae LEE
Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.
LREC 2008 Marrakech 29 May Caroline Lavecchia, Kamel Smaïli and David Langlois LORIA / Groupe Parole, Vandoeuvre-Lès-Nancy, France Phrase-Based Machine.
 Programming - the process of creating computer programs.
Towards the Use of Linguistic Information in Automatic MT Evaluation Metrics Projecte de Tesi Elisabet Comelles Directores Irene Castellon i Victoria Arranz.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Faculty: Alon Lavie, Jaime Carbonell Students and Staff: Gregory Hanneman, Justin.
MEMT: Multi-Engine Machine Translation Guided by Explicit Word Matching Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work.
MEMT: Multi-Engine Machine Translation Faculty: Alon Lavie, Robert Frederking, Ralf Brown, Jaime Carbonell Students: Shyamsundar Jayaraman, Satanjeev Banerjee.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
This has been created by QA InfoTech. Choose QA InfoTech as your Automated testing partner. Visit for more information.
Ling 575: Machine Translation Yuval Marton Winter 2016 February 9: MT Evaluation Much of the materials was borrowed from course slides of Chris Callison-Burch.
Is Neural Machine Translation the New State of the Art?
A CASE STUDY OF GERMAN INTO ENGLISH BY MACHINE TRANSLATION: MOSES EVALUATED USING MOSES FOR MERE MORTALS. Roger Haycock 
Machine Translation Course 9
Multi-Engine Machine Translation
Centre for Translation Studies FACULTY OF ARTS
Statistical Machine Translation Part II: Word Alignments and EM
METEOR: Metric for Evaluation of Translation with Explicit Ordering An Improved Automatic Metric for MT Evaluation Alon Lavie Joint work with: Satanjeev.
Approaches to Machine Translation
Vorlesung Maschinelle Übersetzung, SS 2010
Language Technologies Institute Carnegie Mellon University
The ACCEPT Project Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities.
Erasmus University Rotterdam
Suggestions for Class Projects
--Mengxue Zhang, Qingyang Li
Approaches to Machine Translation
Statistical Machine Translation Papers from COLING 2004
Presentation transcript:

Monoligual Semantic Text Alignment and its Applications in Machine Translation Alon Lavie March 29, 2012

Monolingual Semantic Text Alignment Input: two (semantically-comparable) text strings Output: Most likely word (or multi-word) alignment correspondence between the two strings Example: Applications in MT: Automated MT Evaluation Metrics (METEOR) Automated MT Error Analysis (METEOR) MT System Combination (Multi-Engine MT) Automated MT Post-Editing MT System Optimization March 29, 2012 Monolingual Text Alignment

Monolingual Semantic Text Alignment Monolingual Text Aligner originally developed as an integral component of our METEOR MT Evaluation Metric Input: Sentence-level MT-generated translation and a human-generated reference translation Goal: Assess and assign a quality score to the MT-generated translation based on how similar (in meaning) it is to the reference Address translation variability by explicitly identifying different types of semantically-equivalent correspondences: Exact matches Morphological variants Synonyms Paraphrases Explicit alignment provides information on differences in word-ordering and other characteristics of correspondence beyond short n-gram matches March 29, 2012 Monolingual Text Alignment

Monolingual Semantic Text Alignment Semantic-similarity Information Sources: Porter Stemmers (morphological variants) WordNet (synonyms) Paraphrase Tables (statistically-derived, filtered) Alignment Algorithm: Identify all matches of all types Bipartite Match: each word (or multi-word) can match at most once Single-pass approximate search for the maximal alignment with minimal crossing branches March 29, 2012 Monolingual Text Alignment

Monolingual Text Alignment METEOR METEOR = Metric for Evaluation of Translation with Explicit Ordering [Lavie and Denkowski, 2010] [Denkowski and Lavie, 2011] Main ideas: Combine Recall and Precision as weighted score components Look only at unigram Precision and Recall Align MT output with each reference individually and take score of best pairing Matching takes into account translation variability via word inflection variations, synonymy and paraphrasing matches Addresses fluency via a direct penalty for word order: how fragmented is the matching of the MT output with the reference? Parameters of metric components are tunable to maximize the score correlations with human judgments for each language METEOR has been shown to consistently outperform BLEU and other metrics in correlation with human judgments March 29, 2012 Monolingual Text Alignment

Monolingual Text Alignment METEOR Scoring March 29, 2012 Monolingual Text Alignment

Monolingual Text Alignment METEOR Scoring March 29, 2012 Monolingual Text Alignment

Monolingual Text Alignment METEOR Scoring March 29, 2012 Monolingual Text Alignment

METEOR Parameter Tuning METEOR has several “free” parameters that can be optimized to maximize correlation with different notions of human judgments Alpha, Beta and Gamma control overall metric behavior Tunable weights for different types of matches Latest version (1.3) distinguishes and tunes separate weights for content and function words Optimized for Adequacy, Fluency, A+F, Rankings, and Post-Editing effort for English on available development data Optimized independently for different target languages Limited number of parameters means that optimization can be done by full exhaustive search of the parameter space March 29, 2012 Monolingual Text Alignment

METEOR Parameter Tuning March 29, 2012 Monolingual Text Alignment

Monolingual Text Alignment METEOR Analysis Tools METEOR v1.3 comes with a suite of analysis and visualization tools called METEOR-XRAY March 29, 2012 Monolingual Text Alignment

Monolingual Text Alignment METEOR Analysis Tools March 29, 2012 Monolingual Text Alignment

MT System Combination Idea: apply several MT engines to each input in parallel and combine their output translations Goal: leverage the strengths and diversity of different MT engines to generate an improved translation system Particularly useful in assimilation scenarios where input is uncontrolled and diverse in domain, genre, style or other characteristics Can result in significant gains in translation quality 13

CMU’s Alignment-based Multi-Engine System Combination Works with any MT engines Assumes original MT systems are “black-boxes” – no internal information other than the translations themselves Explores broader search spaces than other MT system combination approaches using linguistically-based and statistical features Achieves state-of-the-art performance in competitive research evaluations of recent years Developed over last seven years under research funding from several government grants (DARPA, DoD and NSF) 14

Alignment-based MEMT Two Stage Approach: Example: Align: Identify and align equivalent words and phrases across the translations provided by the engines Decode: search the space of synthetic combinations of words/phrases and select the highest scoring combined translation Example: announced afghan authorities on saturday reconstituted four intergovernmental committees The Afghan authorities on Saturday the formation of the four committees of government 15

Alignment-based MEMT Two Stage Approach: Example: Align: Identify and align equivalent words and phrases across the translations provided by the engines Decode: search the space of synthetic combinations of words/phrases and select the highest scoring combined translation Example: announced afghan authorities on saturday reconstituted four intergovernmental committees The Afghan authorities on Saturday the formation of the four committees of government MEMT: the afghan authorities announced on Saturday the formation of four intergovernmental committees 16

The MEMT Decoder Algorithm Search-space of system combination hypotheses implicitly defined by the initial alignment stage, and partially explored Search-space is controlled by linguistic similarity features Algorithm builds collections of partial hypotheses of increasing length Partial hypotheses are extended by selecting the “next available” word from one of the original systems Extending a partial hypothesis with a word marks the word as “used” and marks its aligned words as also “used” Partial hypotheses are scored and ranked Pruning and re-combination for efficiency Hypothesis can end if any original system proposes an end of sentence as next word 17

Recent Performance Results NIST-2009 and WMT-2009 18

Recent Performance Results WMT-2011 19

Multi-Engine Human Translation Translation crowdsourcing using untrained bilingual labor is error-prone Idea: combine the output of multiple human translators into a consensus translation using MT system combination methods [Ambati 2011] March 29, 2012 Monolingual Text Alignment

Smoothing MERT in SMT [Cettolo, Bertoldi and Federico 2011] Interesting application of MT system combination to overcome instability of MERT optimization in SMT Perform MERT multiple times Use the CMU MEMT system to combine the different instances of the same MT system 21

MT Automated Post-Editing Idea: Target-language-side adaptation and refinement method Adapt and specialize the output of a single “baseline” cross-lingual MT system to specific data characteristics: genre, domain, dialect, etc. Particularly useful in situations where each specialization has only limited amounts of data available Approach: Train a secondary “translation” system from baseline MT system target-language output to the specialized target output Monolingual text alignment is a highly-effective method for aligning the monolingual “parallel” training data Used extensively by Safaba Translation Solutions for client-specific MT adaptation Goal: minimize the amount of human post-editing required on client-specific MT output March 29, 2012 Monolingual Text Alignment

Safaba Two-Stage MT Approach Baseline MT Target Language Text Client-Adapted Target Language Text Source Language Text Baseline MT Engine Automated Post-Editing (APE) Engine Safaba or Partner-developed (SMT or RBMT) Safaba-developed (Moses-based) Client + General Data Client Data

MT System Optimization Goals: Optimize MT systems to minimize important types of translation errors and maximize utility of generated MT (i.e. who did what to whom) Minimize the amount of human post-editing effort required for correcting the MT output Approach: Develop the capabilities to automatically identify and quantify different types of MT errors (important content words, Named Entities, etc.) Use the above capability to analyze in depth the error characteristics of different types of MT systems (phrase-based, Hierarchical, Syntax- based, etc.) at segment-level granularity Develop an automated evaluation metric that can be used as an effective optimization function in advanced MT tuning optimizers such as PRO and MIRA Existing metrics such as HTER and METEOR 1.3 are starting points Monolingual Semantic Text Alignment is a critical first step

Post-Editing Pilot Study Pilot study conducted with colleagues at Kent State IAL Goals: Develop post-editing analysis infrastructure and identify procedures and issues Conduct a preliminary human post-editing study and analyze post-editing behavior, issues and outcomes Pilot Setup: English-to-Spanish Two MT systems (Microsoft and Safaba) Five short texts (~300 words), computer software documentation domain 20 trials: 5 texts x 2 MT systems x 2 post-edit translations Seven bilingual translator post-editors each post-editing three MT docs Collected all post-editing output + timing information + complete keystroke logger information Expert translator generated segment-level human assessments on a 4-point scale [1= no edit; 2 = minor edit; 3 = major edit; 4 = complete rewrite] Analysis currently underway March 29, 2012 Monolingual Text Alignment

Questions