Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistical NLP Spring 2010

Similar presentations


Presentation on theme: "Statistical NLP Spring 2010"— Presentation transcript:

1 Statistical NLP Spring 2010
Lecture 18: Phrase / Syntactic MT Dan Klein – UC Berkeley

2 Decoding First, consider word-to-word models
Finding best alignments is easy Finding translations is hard (why?)

3 Bag “Generation” (Decoding)

4 Bag Generation as a TSP Imagine bag generation with a bigram LM
Words are nodes Edge weights are P(w|w’) Valid sentences are Hamiltonian paths Not the best news for word-based MT (and it doesn’t get better with phrases) is it . not clear

5 IBM Decoding as a TSP

6 Phrase-Based Systems Phrase table (translation model) Sentence-aligned
corpus cat ||| chat ||| 0.9 the cat ||| le chat ||| 0.8 dog ||| chien ||| 0.8 house ||| maison ||| 0.6 my house ||| ma maison ||| 0.9 language ||| langue ||| 0.9 Phrase table (translation model) Word alignments

7 The Pharaoh “Model” [Koehn et al, 2003] Segmentation Translation
Distortion

8 The Pharaoh “Model” Where do we get these counts?

9

10 Phrase-Based Decoding
这 7人 中包括 来自 法国 和 俄罗斯 的 宇航 员 Decoder design is important: [Koehn et al. 03]

11 The Pharaoh Decoder Probabilities at each step include LM and TM

12 Hypotheis Lattices

13 Pruning Problem: easy partial analyses are cheaper
Solution 1: use beams per foreign subset Solution 2: estimate forward costs (A*-like)

14 Phrase Scoring Learning weights has been tried, several times:
[Marcu and Wong, 02] [DeNero et al, 06] … and others Seems not to work well, for a variety of partially understood reasons Main issue: big chunks get all the weight, obvious priors don’t help Though, [DeNero et al 08] les chats aiment le poisson cats like fresh fish . frais

15 Extraction Sets GIZA BITG ExSets [DeNero and Klein, in submission]

16 Phrase Size Phrases do help But they don’t need to be long
Why should this be?

17 Lexical Weighting

18 WSD? Remember when we discussed WSD?
Word-based MT systems rarely have a WSD step Why not?

19 Syntax-Based MT

20 Translation by Parsing

21 Translation by Parsing

22 Compact Forests

23 Compact Forests

24 Compact Forests

25 Compact Forests

26 Learning MT Grammars

27 Extracting syntactic rules
Extract rules (Galley et. al. ’04, ‘06)

28 Rules can... capture phrasal translation reorder parts of the tree
traverse the tree without reordering insert (and delete) words

29 Bad alignments make bad rules
This isn’t very good, but let’s look at a worse example...

30 Sometimes they’re really bad
One bad link makes a totally unusable rule!

31 Discriminative Block ITG
Features φ( b0, s, s’ ) φ( b1, s, s’ ) φ( b2, s, s’ ) b0 b1 b2 recent years country entire in the warn to 近年 提醒 [Haghighi, Blitzer, Denero, and Klein, ACL 09]

32 Syntactic Correspondence
Build a model 中文 EN

33 Synchronous Grammars? Now how do we talk about a model which has 2 corresponding trees? Synchronicity

34 Synchronous Grammars?

35 Synchronous Grammars?

36 Adding Syntax: Weak Synchronization
Block ITG Alignment

37 Adding Syntax: Weak Synchronization
Separate PCFGs

38 Adding Syntax: Weak Synchronization
Get points for synchronization; not required

39 Weakly Synchronous Features
Parsing Alignment (IP, s) (b0, s, s’) (NP, s) (b1, s, s’) (VP, s) (b2, s, s’) (S, s’) (IP, b0) (NP, s’) (b0, S) (AP, s’) (b1, NP) (VP, s’) (IP, b0, S) NP AP VP b0 b1 NP Agreement IP b2 VP

40 Weakly Synchronous Model
EN EN 中文 中文 EN 中文 EN EN 中文 中文 Feature Type 1: Word Alignment EN 中文 Feature Type 3: Agreement 办公室 office PP [HBDK09] Our model can do more. We can handle non-synchronicity. Feature Type 2: Monolingual Parser EN PP in the office

41 Inference: Structured Mean Field
Problem: Summing over weakly aligned hypotheses is intractable Factored approximation: EN 中文 Set to minimize EN 中文 Algorithm Initialize: Iterate: PP PP

42 Results [Burkett, Blitzer, and Klein, NAACL 10]

43 Incorrect English PP Attachment

44 Corrected English PP Attachment

45 Improved Translations
目前 导致 飞机 相撞 原因 清楚, 当地 民航 部门 对此 展开 调查 Cur-rently cause plane crash DE reason yet not clear, local civil aero-nautics bureau will toward open investi-gations Reference At this point the cause of the plane collision is still unclear. The local caa will launch an investigation into this . Baseline (GIZA++) The cause of planes is still not clear yet, local civil aviation department will investigate this . mu4qian1 dao3zhi4 fei1ji1 xiang1zhuang4 DE yuan2yin1 shang4 bu4 qing1chu3, dang1di4 min2hang2 bu4men2 jiang1 dui4ci3 zhan3kai1 diao4cha2 Bilingual Adaptation Model The cause of plane collision remained unclear, local civil aviation departments will launch an investigation .


Download ppt "Statistical NLP Spring 2010"

Similar presentations


Ads by Google