A Word-Class Approach to Labeling PSCFG Rules for Machine Translation (ACL 2011) Andreas Zollmann and Stephan Vogel Presented by Yun Huang 01/07/2011.

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

MIXED SYNTAX TRANSLATION Hieu Hoang MT Marathon 2011 Trento.
Prolog programming....Dr.Yasser Nada. Chapter 8 Parsing in Prolog Taif University Fall 2010 Dr. Yasser Ahmed nada prolog programming....Dr.Yasser Nada.
Cluster Computing for Statistical Machine Translation Qin Gao, Kevin Gimpel, Alok Palikar, Andreas Zollmann Stephan Vogel, Noah Smith.
Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Self-training with Products of Latent Variable Grammars Zhongqiang Huang, Mary Harper, and Slav Petrov.
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
Introduction to Syntax, with Part-of-Speech Tagging Owen Rambow September 17 & 19.
Sequence Classification: Chunking Shallow Processing Techniques for NLP Ling570 November 28, 2011.
Grammar Development Platform Miriam Butt October 2002.
Natural Language Processing Projects Heshaam Feili
Part of Speech Tagging Importance Resolving ambiguities by assigning lower probabilities to words that don’t fit Applying to language grammatical rules.
For Monday Read Chapter 23, sections 3-4 Homework –Chapter 23, exercises 1, 6, 14, 19 –Do them in order. Do NOT read ahead.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
The current status of Chinese- English EBMT -where are we now Joy (Ying Zhang) Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
1 A Tree Sequence Alignment- based Tree-to-Tree Translation Model Authors: Min Zhang, Hongfei Jiang, Aiti Aw, et al. Reporter: 江欣倩 Professor: 陳嘉平.
A Non-Parametric Bayesian Approach to Inflectional Morphology Jason Eisner Johns Hopkins University This is joint work with Markus Dreyer. Most of the.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Shallow Parsing.
Matakuliah: G0922/Introduction to Linguistics Tahun: 2008 Session 11 Syntax 2.
Learning Accurate, Compact, and Interpretable Tree Annotation Slav Petrov, Leon Barrett, Romain Thibaux, Dan Klein.
Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
“Applying Morphology Generation Models to Machine Translation” By Kristina Toutanova, Hisami Suzuki, Achim Ruopp (Microsoft Research). UW Machine Translation.
TIDES MT Workshop Review. Using Syntax?  ISI-small: –Cross-lingual parsing/decoding Input: Chinese sentence + English lattice built with all possible.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
Unification-based algorithms We have seen that treating features, such as person-number agreement, as part of the category of a phrase results in a large.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
A Hierarchical Phrase-Based Model for Statistical Machine Translation Author: David Chiang Presented by Achim Ruopp Formulas/illustrations/numbers extracted.
Probabilistic Parsing Ling 571 Fei Xia Week 5: 10/25-10/27/05.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
SI485i : NLP Set 9 Advanced PCFGs Some slides from Chris Manning.
Czech-to-English Translation: MT Marathon 2009 Session Preview Jonathan Clark Greg Hanneman Language Technologies Institute Carnegie Mellon University.
PFA Node Alignment Algorithm Consider the parse trees of a Chinese-English parallel pair of sentences.
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
Recent Major MT Developments at CMU Briefing for Joe Olive February 5, 2008 Alon Lavie and Stephan Vogel Language Technologies Institute Carnegie Mellon.
The ICT Statistical Machine Translation Systems for IWSLT 2007 Zhongjun He, Haitao Mi, Yang Liu, Devi Xiong, Weihua Luo, Yun Huang, Zhixiang Ren, Yajuan.
NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.
Advanced MT Seminar Spring 2008 Instructors: Alon Lavie and Stephan Vogel.
Transfer-based MT with Strong Decoding for a Miserly Data Scenario Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work with:
Publications Vamshi Ambati, Stephan Vogel and Jaime Carbonell. "Collaborative Workflow for Crowdsourcing Translation”, To Appear in the 2012 ACM Conference.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
What’s in a translation rule? Paper by Galley, Hopkins, Knight & Marcu Presentation By: Behrang Mohit.
CPE 480 Natural Language Processing Lecture 4: Syntax Adapted from Owen Rambow’s slides for CSc Fall 2006.
Intelligent Database Systems Lab Presenter : Kung, Chien-Hao Authors : Yoong Keok Lee and Hwee Tou Ng 2002,EMNLP An Empirical Evaluation of Knowledge Sources.
INSTITUTE OF COMPUTING TECHNOLOGY Forest-to-String Statistical Translation Rules Yang Liu, Qun Liu, and Shouxun Lin Institute of Computing Technology Chinese.
CSA2050 Introduction to Computational Linguistics Parsing I.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
CS 4705 Lecture 17 Semantic Analysis: Robust Semantics.
NLP. Introduction to NLP Time flies like an arrow –Many parses –Some (clearly) more likely than others –Need for a probabilistic ranking method.
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
October 10, 2003BLTS Kickoff Meeting1 Transfer with Strong Decoding Learning Module Transfer Rules {PP,4894} ;;Score: PP::PP [NP POSTP] -> [PREP.
A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, et al. ACL 2009.
CMU MilliRADD Small-MT Report TIDES PI Meeting 2002 The CMU MilliRADD Team: Jaime Carbonell, Lori Levin, Ralf Brown, Stephan Vogel, Alon Lavie, Kathrin.
Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.
1 Local Search for Optimal Permutations Jason Eisner and Roy Tromble with Very Large-Scale Neighborhoods in Machine Translation.
LING 575 Lecture 5 Kristina Toutanova MSR & UW April 27, 2010 With materials borrowed from Philip Koehn, Chris Quirk, David Chiang, Dekai Wu, Aria Haghighi.
Question Classification Ling573 NLP Systems and Applications April 25, 2013.
Natural Language Processing Vasile Rus
Syntax-based Statistical Machine Translation Models
Suggestions for Class Projects
Machine Learning in Natural Language Processing
Vamshi Ambati 14 Sept 2007 Student Research Symposium
Johns Hopkins 2003 Summer Workshop on Syntax and Statistical Machine Translation Chapters 5-8 Ethan Phelps-Goodman.
David Kauchak CS159 – Spring 2019
Artificial Intelligence 2004 Speech & Natural Language Processing
Rule Markov Models for Fast Tree-to-String Translation
Presentation transcript:

A Word-Class Approach to Labeling PSCFG Rules for Machine Translation (ACL 2011) Andreas Zollmann and Stephan Vogel Presented by Yun Huang 01/07/2011

2 Background PSCFG (Chiang 2005,2007) –Rules: X => (γ/α/ w) X=>( / held talk with Sharon) X=>( X 1 / held talk with X 1 ) X=>( X 1 X 2 / held X 2 with X 1 ) –Glue rules: S=>(X / X) S=>(S X / S X) –Decoding: cube-pruning, etc.

3 Motivation Only S and X are used in PSCFG, can not model different rule categories. Example: –X=>( X 1 X 2 / held X 2 with X 1 ) –No difference between X 1 and X 2 Maybe we want … –VP=>( PRP NP / held NP with PRP) Idea: multi-label PSCFG. How to label hierarchical phrases?

4 Simple: boundary (POS) tags –I[PRP] saw[VBD] him[PRP] Extracted rules: –PRP-PRP => (ich / I) –PRP-PRP => (ihn / him) –VBD-VBD => (gesehen / saw) –VBD-PRP => (habe ihn gesehen / saw him) –VBD-PRP => (Ich habe ihn gesehen / I saw him) –VBD-PRP => (habe PRP-PRP gesehen / saw PRP- PRP Labeling from word classes(1/4)

5 Labeling from word classes(2/4) Accounting for phrase size –1-word PRP=>(Ich | I) PRP=>(ihn | him) –2-word VBD-PRP => (habe ihn gesehen / saw him) VBD-PRP => (habe PRP gesehen / saw PRP) –multiple-word VBD..PRP => (Ich habe ihn gesehen / I saw him) VBD..PRP => (Ich habe PRP gesehen / I saw PRP)

6 Labeling from word classes(3/4) Bilingually tagged corpus –Ich[PRP] habe[AUX] ihn[PRP] gesehen[VBN] –I[PRP] saw[VBD] him[PRP] Extracted rules: (src label+tgt label) –PRP+PRP => (ich / I) –PRP+PRP => (ihn / him) –VBN+VBD => (gesehen / saw) –AUX..VBN+VBD-PRP => (habe ihn gesehen / saw him) –PRP..VBN+PRP..PRP => (Ich habe ihn gesehen / I saw him) –AUX..VBN+VBD-PRP => (habe PRP+PRP gesehen / saw PRP+PRP

7 Labeling from word classes(4/4) Unsupervised word class clustering –MKCLS –Morphological information Problems of word classes: –Huge grammar size –Data sparseness –Solution: directly clustering rules

8 Clustering phrase pairs Directly clustering phrase pairs K-means clustering algorithm

9 Experiments Baseline PTB POS Tags Word Class Clustering Phrase Clustering

10 Experiments

11 Related Work JHU workshop 2010 – ps/msgismt/ ps/msgismt/ Other approaches –Phrase clustering –Syntax-augmented MT Source code: –SAMT system

12 Problems Too simple, sometimes naïve. –Simple features –Simple clustering method –How to control model complexity Future work –Other learning method instead of clustering –Combining hierarchical phrase based model with syntactical trees