Korea Maritime and Ocean University NLP Jung Tae LEE

Slides:

Advertisements

Similar presentations

Statistical Machine Translation

Advertisements

Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.

Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser Institute for Natural Language Processing University of Stuttgart

Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.

Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart

SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.

Word Sense Disambiguation for Machine Translation Han-Bin Chen

Word Alignment Philipp Koehn USC/Information Sciences Institute USC/Computer Science Department School of Informatics University of Edinburgh Some slides.

Chinese Word Segmentation Method for Domain-Special Machine Translation Su Chen; Zhang Yujie; Guo Zhen; Xu Jin’an Beijing Jiaotong University.

Confidence Estimation for Machine Translation J. Blatz et.al, Coling 04 SSLI MTRG 11/17/2004 Takahiro Shinozaki.

Carnegie Mellon 1 Maximum Likelihood Estimation for Information Thresholding Yi Zhang & Jamie Callan Carnegie Mellon University

Why Generative Models Underperform Surface Heuristics UC Berkeley Natural Language Processing John DeNero, Dan Gillick, James Zhang, and Dan Klein.

Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.

Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.

Symmetric Probabilistic Alignment Jae Dong Kim Committee: Jaime G. Carbonell Ralf D. Brown Peter J. Jansen.

ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.

1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.

MACHINE TRANSLATION AND MT TOOLS: GIZA++ AND MOSES -Nirdesh Chauhan.

Natural Language Processing Expectation Maximization.

Translation Model Parameters (adapted from notes from Philipp Koehn & Mary Hearne) 24 th March 2011 Dr. Declan Groves, CNGL, DCU

An Introduction to SMT Andy Way, DCU. Statistical Machine Translation (SMT) Translation Model Language Model Bilingual and Monolingual Data* Decoder:

English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.

Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.

Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.

Probabilistic Model for Definitional Question Answering Kyoung-Soo Han, Young-In Song, and Hae-Chang Rim Korea University SIGIR 2006.

1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.

Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.

Learning Phonetic Similarity for Matching Named Entity Translation and Mining New Translations Wai Lam, Ruizhang Huang, Pik-Shan Cheung ACM SIGIR 2004.

Active Learning for Statistical Phrase-based Machine Translation Gholamreza Haffari Joint work with: Maxim Roy, Anoop Sarkar Simon Fraser University NAACL.

Eric H. Huang, Richard Socher, Christopher D. Manning, Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA ImprovingWord.

Machine Translation Course 5 Diana Trandab ă ț Academic year:

Morpho Challenge competition Evaluations and results Authors Mikko Kurimo Sami Virpioja Ville Turunen Krista Lagus.

Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,

Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.

Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling Ferhan Ture and Jimmy Lin University of Maryland,

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

Bayesian Word Alignment for Statistical Machine Translation Authors: Coskun Mermer, Murat Saraclar Present by Jun Lang I2R SMT-Reading Group.

ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.

Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.

Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.

NRC Report Conclusion Tu Zhaopeng NIST06  The Portage System  For Chinese large-track entry, used simple, but carefully- tuned, phrase-based.

An Iterative Approach to Extract Dictionaries from Wikipedia for Under-resourced Languages G. Rohit Bharadwaj Niket Tandon Vasudeva Varma Search and Information.

LREC 2008 Marrakech 29 May Caroline Lavecchia, Kamel Smaïli and David Langlois LORIA / Groupe Parole, Vandoeuvre-Lès-Nancy, France Phrase-Based Machine.

Yajuan Lü, Jin Huang and Qun Liu EMNLP, 2007 Presented by Mei Yang, May 12nd, 2008 Improving SMT Preformance by Training Data Selection and Optimization.

From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.

Mutual bilingual terminology extraction Le An Ha*, Gabriela Fernandez**, Ruslan Mitkov*, Gloria Corpas*** * University of Wolverhampton ** Universidad.

August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.

Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,

Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.

1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.

Phrase-Based Statistical Machine Translation as a Traveling Salesman Problem Mikhail Zaslavskiy Marc Dymetman Nicola Cancedda ACL 2009.

NLP. Machine Translation Source-channel model of communication Parametric probabilistic models of language and translation.

Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart

Pastra and Saggion, EACL 2003 Colouring Summaries BLEU Katerina Pastra and Horacio Saggion Department of Computer Science, Natural Language Processing.

An evolutionary approach for improving the quality of automatic summaries Constantin Orasan Research Group in Computational Linguistics School of Humanities,

Automatic Question Answering Beyond the Factoid Radu Soricut Information Sciences Institute University of Southern California Eric Brill Microsoft Research.

Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,

Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,

Statistical Machine Translation Part II: Word Alignments and EM

Neural Machine Translation by Jointly Learning to Align and Translate

Expectation-Maximization Algorithm

Machine Translation and MT tools: Giza++ and Moses

Improved Word Alignments Using the Web as a Corpus

Statistical Machine Translation Papers from COLING 2004

Word Alignment David Kauchak CS159 – Fall 2019 Philipp Koehn

Machine Translation and MT tools: Giza++ and Moses

Presented By: Sparsh Gupta Anmol Popli Hammad Abdullah Ayyubi

Pushpak Bhattacharyya CSE Dept., IIT Bombay 31st Jan, 2011

Neural Machine Translation by Jointly Learning to Align and Translate

Presentation transcript:

Korea Maritime and Ocean University NLP Jung Tae LEE

` 1. Introduction For Domain-specific Statistical Machine Translation setting  High-quality SMT system can only built with large quantities of parallel text  It is good enough to boost the performance of and out-of-domain system trained Proposed Method  Without worrying about the position of sentences in the text  Use a customized metric, which combines different similarity criteria

` SMT(Statistical Machine Translation)?  One of MT(Machine Translation)  It isn’t care about the rule of language  Don’t need manpower 2. What is SMT? But its not enough to parallel text for SMT system Representative services  Google’s translator  BingTranslator

` Biligual Parallel Corpus Target Language Corpus Translation Model Language Model Statistical Alignment & AnalysisStatistical Language Modeling Decoder e1· · ·eI (target) f1· · · fJ (source) Find sentence best score

`

` IBM Model 1 : basic model in SMT  The IBM models are an instance of a noisy-channel model  Phrase-based statistical MT - it use the alignment algorithms just to find the best alignment for sentence pair (F, E), in order to help extract a set of phrases.  The parameters of the IBM models will be estimated using the expectation maximization (EM) algorithm

` EM algorithm (1) Initialization - Assign random(or uniform) probability (2) Maximization Step - Re-create cluster model - Re-compute the parameter (mean, variance) (3) Expectation Step - Update record's weight (4) Stopping criteria - Calculate log-likelihood, If the value saturate, exit, else Goto Step 2

` EM algorithm (1)Initialization green house the house casa verde la casa E = {green, house, the}, S = {casa, verde, la} Start with uniform probabilities :

` EM algorithm green house green house the house the house casa verde casa verde la casa la casa

` EM algorithm green house green house the house the house casa verde casa verde la casa la casa

` Compute the MLE probability parameters by normalizing the tcount sum to 1. total(green) = 1 total(house) = 2 total(the) = 1

` EM algorithm green house green house the house the house casa verde casa verde la casa la casa ※ Note that the two correct alignments are now higher in probability than the two incorrect alignments.

` 3. Introduced method Architecture of parallel sentence Extraction process

` Finding candidate article By interlanguage links Provided by wikipedia itself ->don’t worry about doc align Input queries contains the 100most frequent mountaineering Keywords in the *Text+Berg corpus *Text+Berg corpus : training data corpus ->This for avoid a false positive

` Finding parallel segments in Wikipedia articles The general idea is to split the sentence into clauses containing a single full verb. Similarity score criteria : 1. METEOR scroe - it can appreciate pronouns(like je and j’) 2. Number of alinged content word - it can increase the chance of other candidates with similar length

` Finding parallel segments in Wikipedia articles The distribution of the extracted clause pairs at different thresholds

` Finding parallel segments in Wikipedia articles The average sentence length for different score ranges

` Experiments and Results Extract 225,000 parallel clause pairs. (in 39,000 parallel articles) Perfect translationWith extra segmentmisalignment 39%26%35% Set of 200 automatically aligned clauses with similarity scores above 0.25 For this test set : However, given the high degree of parallelism between the clauses from The middle class, achieving a precision : 65%

` 4. MT Evaluation Settings : - use only pairs with a similarity score above SMT systems are trained with the Moses toolkit - baseline system is the same used for the automatic translations required in the extraction step - translation performance was measured using the BLEU evaluation metric

` BLEU score BLEU 의 기본 전제 아이디어는 기계번역이 전문 번역인에 의한 번역과 비슷할수록 기계번역의 질은 좋을 것이라는 것이다 전문 번역 기계 번역 번역기 비교 *P-value( 유의확률 ) : 귀무가설이 맞을 경우, 표본에서 얻은 표본평균보다 대립가설 쪽의 값이 나올 확률이 얼마나 되는지. 본 논문에서는 0.05~0.06

` If the data is of good quality, it can improve the performance of the system, otherwise it significantly deteriorates it. quantity of the data is not the decisive factor for the performance change, but rather the quality of the data SMT result for German-French Summary table

Korea Maritime and Ocean University NLP Jung Tae LEE