Download presentation
Presentation is loading. Please wait.
1
Introduction to IBM Model 1&2 Alignment
ι±ηη₯Ί
2
machine translation model
French π ο English π π π π = π π,π π π = π π π π π π π π π π π π β = arg max π π π π = arg max π π π π π π π π the language model π π π the translation model (difficult)
3
why alignment e (π words) = And the program has been implemented
f (π words)= Le programme a ete mis en application π π π,π is difficult with the help of alignment: π π π,π = πβπ π π,π π,π
4
alignment e (π words) = And the program has been implemented
f (π words)= Le programme a ete mis en application alignment = {2,3,4,5,6,6,6} # of all possible alignments = 1 π+1 π
5
alignment e (π words) = And the program has been implemented
f (π words) = __ _______ __ __ ___ ___ ______ programme π π π,π = πβπ π π,π π,π π π,π π,π =π π π,π π π π,π,π π π π,π = πβπ π π π,π π π π,π,π
6
most likely alignment π π,π π,π =π π π,π π π π,π,π
π π π,π = πβπ π π π,π π π π,π,π π π π,π,π = π π,π π,π πβπ π,π π,π = π π,π π,π π π π,π π β = arg max π π π π,π,π
7
alignment example French English Alignment
le conseil a rendu son avis, et nous devons à présent adopter un nouvel avis sur la base de la première position. English the council has stated its position, and now, on the basis of the first position, we again have to give our opinion. Alignment the/le council/conseil has/à stated/rendu its/son position/avis ,/, and/et now/présent ,/, on/sur the/le basis/base of/de the/la first/première position/position ,/NULL we/nous again/NULL have/devons to/a give/adopter our/nouvel opinion/avis ./.
8
IBM Model 1 e (π words) = And the program has been implemented
f (π words) = __ _______ __ __ ___ ___ ______ π π,π π,π =π π π,π π π π,π,π all alignments are equally likely: π π π,π = 1 π+1 π there are π+1 π possible values for π
9
IBM Model 1 e (π words) = And the program has been implemented
f (π words) = __ _______ __ __ ___ ___ ______ la programme a ete mis en application π π,π π,π =π π π,π π π π,π,π all alignments are equally likely: π π π,π,π = π=1 π π‘ π π π π π π(each aligned term translation)
10
IBM Model 1 estimate π‘ π π train from parallel corpus
11
IBM Model 2 probability that π'th French word ( π π ) align to π'th English word ( π π ), given length π and π π π π,π,π π π π,π = π=1 π π π π π,π,π β 1 π+1 π π π,π π,π =π π π,π π π π,π,π = π=1 π π π π π,π,π π‘ π π π π π
12
IBM Model 2 e (π words) = And the program has been implemented
f (π words) = __ _______ __ __ ___ ___ ______ programme π π π,π =π π π,7 =π 2|1,6,7 βπ 3|2,6,7 βπ 4|3,6, βπ 5|4,6,7 βπ 6|5,6,7 βπ 6|6,6, βπ 6|7,6,7 π π,π π,π = π=1 π π π π π,π,π π‘ π π π π π
13
IBM Model 2 e (π words) = (NULL) And the program has been implemented
f (π words)= Le programme a ete mis en application π π = arg max πβ 0β¦π π π π,π,π βπ‘( π π | π π ) NULL: π 0 3,6,7 βπ‘(π|πππΏπΏ) And: π 1 3,6,7 βπ‘(π|π΄ππ) the: π 2 3,6,7 βπ‘(π|π‘βπ) program: π 3 3,6,7 βπ‘(π|πππππππ) has: π 4 3,6,7 βπ‘(π|βππ ) been: π 5 3,6,7 βπ‘(π|ππππ) implemented: π 6 3,6,7 βπ‘(π|πππππππππ‘ππ)
14
EM for IBM Model 2 input output: challenge: a set of bitext π π , π π
π = And the program has been implemented π = Le programme a ete mis en application output: π‘ π π & π π π,π,π challenge: do not have alignments on training data
15
EM for IBM Model 2 if alignments are observed
estimate π‘ π π & π π π,π,π by counting π‘ ππΏ π π = πΆππ’ππ‘ π,π πΆππ’ππ‘ π π ππΏ π π,π,π = πΆππ’ππ‘ π π,π,π πΆππ’ππ‘ π,π,π
16
EM for IBM Model 2 For π =1β¦π Set all counts π β¦ =0 For π=1β¦π For π=1β¦ π π , For π=0β¦ π π π π π π , π π π βπ π π π , π π π +πΏ π, π, π π π π π βπ π π π +πΏ π, π, π π π π, π, π βπ π π, π, π +πΏ π, π, π π π, π, π βπ π, π, π +πΏ π, π, π Recalculate the parameters: π‘ π π = π π, π π π π π π, π, π = π π π, π, π π π, π, π
17
EM for IBM Model 2 e (π words) = (NULL) And the program has been implemented f (π words)= Le programme a ete mis en application if alignments are observed πΏ π,π,π =1 ππ π π π =π do not have alignments on training data πΏ π,π,π = π π π, π π , π π π‘ π π π π π π π=0 π π π π π, π π , π π π‘ π π π π π π πΏ π,π,π =π π π π =π π π , π π ;π‘,π
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.