Pushpak Bhattacharyya CSE Dept., IIT Bombay

Pushpak Bhattacharyya CSE Dept., IIT Bombay
CS626/449 : Natural Language Processing, Speech and the Web/Topics in AI Lecture 31: POS Tagging (discussion to assist the CMU pronunciation dictionary assignment) Pushpak Bhattacharyya CSE Dept., IIT Bombay

Lexicon Example ^_ Some_ People_ Jump_ High_ ._
Lexicon/ Lexical Example Dictionary Tag Some A (Adjective) {Quantifier} People N (Noun) lot of people V (Verb) peopled the city with soldiers Jump V (Verb) he jumped high N (Noun) This was a good jump High R (Adverb) He jumped high A (Adjective) high mountain N (Noun) Bombay high; on a high

Generative Model ^_^ People_N Jump_V High_R ._. Lexical Probabilities
Bigram Probabilities N A A This model is called Generative model. Here words are observed from tags as states. This is similar to HMM.

Bigram probabilities

Lexical Probability

Calculation from actual data
Corpus ^ Ram got many NLP books. He found them all very interesting. Pos Tagged ^ N V A N N . ^ N V N A R A .

Recording numbers ^ N V A R . 2 1

Probabilities ^ N V A R . 1 1/5 2/5 1/2 1/3

Compare with the Pronunciation Dictionary Assignment
Phoneme Example Translation AE at AE T AH hut HH AH T AO ought AO T AW cow K AW AY hide HH AY D B be B IY In POS tagging the Labels are already given on the words. The “alignment” of Words with labels are already Given. In the assignment the most Likely alignment is to be Discovered followed by the Best possible mapping.

Pushpak Bhattacharyya CSE Dept., IIT Bombay

Similar presentations

Presentation on theme: "Pushpak Bhattacharyya CSE Dept., IIT Bombay"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Pushpak Bhattacharyya CSE Dept., IIT Bombay

Similar presentations

Presentation on theme: "Pushpak Bhattacharyya CSE Dept., IIT Bombay"— Presentation transcript:

Similar presentations

About project

Feedback