CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011.

CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011

Outline for today Interactive Session ! Brief Review of MT Examples
Brief EM review

Concepts Translation Probabilities (t) Distortion Probabilities (d)
Fertility (ø) NULL

PA2 Requirements Naïve Model IBM Model 1 IBM Model 2
Integration with Decoder

IBM Model 1 Simplest of the IBM models
Does not consider word order (bag-of-words approach) Does not model one-to-many alignments Computationally inexpensive Useful for parameter estimations that are passed on to more elaborate models

IBM Model 1 We only learn the translation probabilities.

IBM Model 1 Steps Initialize the probabilities uniformly. E-Step
M –Step Calculate Repeat until convergence Let’s do an example

IBM Model 2 In model two we learn translation probabilities and also distortion probabilities.

IBM Model 2 IBM Model 2 tries to learn the alignment probabilities in addition to the translation probabilities. The alignment probabilities are handled at an abstract level, by grouping alignment pairs into buckets. Let the number of buckets be N (indexed from 0:N-1) For a pair , let n = ,the pair is placed is bucket n if n<N-1 or in the Nth bucket if n>=N.

IBM Model 2 In Model 2, during the EM step we also collect fractional counts of each bucket and subsequently normalize the same to have a true probability distribution. Many possible implementations – Variable number of Buckets Signed Buckets Hand Fixed Weights

EM Revisited Similar to k-means Soft Count v/s Hard Counts

Tips Start Early Read Knight’s Tutorial
Plan your approach before you start

Questions ?

CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011.

Similar presentations

Presentation on theme: "CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011.

Similar presentations

Presentation on theme: "CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011."— Presentation transcript:

Similar presentations

About project

Feedback