Download presentation
Presentation is loading. Please wait.
1
Statistical NLP Spring 2010
Lecture 18: Phrase / Syntactic MT Dan Klein – UC Berkeley
2
Decoding First, consider word-to-word models
Finding best alignments is easy Finding translations is hard (why?)
3
Bag “Generation” (Decoding)
4
Bag Generation as a TSP Imagine bag generation with a bigram LM
Words are nodes Edge weights are P(w|w’) Valid sentences are Hamiltonian paths Not the best news for word-based MT (and it doesn’t get better with phrases) is it . not clear
5
IBM Decoding as a TSP
6
Phrase-Based Systems Phrase table (translation model) Sentence-aligned
corpus cat ||| chat ||| 0.9 the cat ||| le chat ||| 0.8 dog ||| chien ||| 0.8 house ||| maison ||| 0.6 my house ||| ma maison ||| 0.9 language ||| langue ||| 0.9 … Phrase table (translation model) Word alignments
7
The Pharaoh “Model” [Koehn et al, 2003] Segmentation Translation
Distortion
8
The Pharaoh “Model” Where do we get these counts?
10
Phrase-Based Decoding
这 7人 中包括 来自 法国 和 俄罗斯 的 宇航 员 Decoder design is important: [Koehn et al. 03]
11
The Pharaoh Decoder Probabilities at each step include LM and TM
12
Hypotheis Lattices
13
Pruning Problem: easy partial analyses are cheaper
Solution 1: use beams per foreign subset Solution 2: estimate forward costs (A*-like)
14
Phrase Scoring Learning weights has been tried, several times:
[Marcu and Wong, 02] [DeNero et al, 06] … and others Seems not to work well, for a variety of partially understood reasons Main issue: big chunks get all the weight, obvious priors don’t help Though, [DeNero et al 08] les chats aiment le poisson cats like fresh fish . frais
15
Extraction Sets GIZA BITG ExSets [DeNero and Klein, in submission]
16
Phrase Size Phrases do help But they don’t need to be long
Why should this be?
17
Lexical Weighting
18
WSD? Remember when we discussed WSD?
Word-based MT systems rarely have a WSD step Why not?
19
Syntax-Based MT
20
Translation by Parsing
21
Translation by Parsing
22
Compact Forests
23
Compact Forests
24
Compact Forests
25
Compact Forests
26
Learning MT Grammars
27
Extracting syntactic rules
Extract rules (Galley et. al. ’04, ‘06)
28
Rules can... capture phrasal translation reorder parts of the tree
traverse the tree without reordering insert (and delete) words
29
Bad alignments make bad rules
This isn’t very good, but let’s look at a worse example...
30
Sometimes they’re really bad
One bad link makes a totally unusable rule!
31
Discriminative Block ITG
Features φ( b0, s, s’ ) φ( b1, s, s’ ) φ( b2, s, s’ ) b0 b1 b2 recent years country entire in the warn to 近年 全 提醒 来 国 [Haghighi, Blitzer, Denero, and Klein, ACL 09]
32
Syntactic Correspondence
Build a model 中文 EN
33
Synchronous Grammars? Now how do we talk about a model which has 2 corresponding trees? Synchronicity
34
Synchronous Grammars?
35
Synchronous Grammars?
36
Adding Syntax: Weak Synchronization
Block ITG Alignment
37
Adding Syntax: Weak Synchronization
Separate PCFGs
38
Adding Syntax: Weak Synchronization
Get points for synchronization; not required
39
Weakly Synchronous Features
Parsing Alignment (IP, s) (b0, s, s’) (NP, s) (b1, s, s’) (VP, s) (b2, s, s’) (S, s’) (IP, b0) (NP, s’) (b0, S) (AP, s’) (b1, NP) (VP, s’) (IP, b0, S) NP AP VP b0 b1 NP Agreement IP b2 VP
40
Weakly Synchronous Model
EN EN 中文 中文 EN 中文 EN EN 中文 中文 Feature Type 1: Word Alignment EN 中文 Feature Type 3: Agreement 办公室 office PP [HBDK09] Our model can do more. We can handle non-synchronicity. Feature Type 2: Monolingual Parser EN PP in the office
41
Inference: Structured Mean Field
Problem: Summing over weakly aligned hypotheses is intractable Factored approximation: EN 中文 Set to minimize EN 中文 Algorithm Initialize: Iterate: PP PP
42
Results [Burkett, Blitzer, and Klein, NAACL 10]
43
Incorrect English PP Attachment
44
Corrected English PP Attachment
45
Improved Translations
目前 导致 飞机 相撞 的 原因 尚 不 清楚, 当地 民航 部门 将 对此 展开 调查 Cur-rently cause plane crash DE reason yet not clear, local civil aero-nautics bureau will toward open investi-gations Reference At this point the cause of the plane collision is still unclear. The local caa will launch an investigation into this . Baseline (GIZA++) The cause of planes is still not clear yet, local civil aviation department will investigate this . mu4qian1 dao3zhi4 fei1ji1 xiang1zhuang4 DE yuan2yin1 shang4 bu4 qing1chu3, dang1di4 min2hang2 bu4men2 jiang1 dui4ci3 zhan3kai1 diao4cha2 Bilingual Adaptation Model The cause of plane collision remained unclear, local civil aviation departments will launch an investigation .
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.