Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.

Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley

Grammar Induction DT NN VBD DT NN IN NN The screen was a sea of red

First Attempt DT NN VBD DT NN IN NN The screen was a sea of red

Central Questions How do we specify what we want to learn? How do we fix observed errors? What’s an NP? That’s not quite it!

Experimental Set-up Binary Grammar { X 1, X 2, … X n } plus POS tags Data WSJ-10 [7k sentences] Evaluate on Labeled F 1 Grammar Upper Bound: 86.1 XjXj X i XkXk

Experiment Roadmap Unconstrained Induction Need bracket constraint! Gold Bracket Induction Prototypes and Similarity CCM Bracket Induction PhrasePrototype NPDT NN PPIN DT NN

Unconstrained PCFG Induction Learn PCFG with EM Inside-Outside Algorithm Lari & Young [93] Results 0 i j n  (Inside)  (Outside)

Gold Brackets Periera & Schables [93] Result Constrained PCFG Induction

Encoding Knowledge What’s an NP? Semi-Supervised Learning

Encoding Knowledge What’s an NP? For instance, DT NN JJ NNS NNP Prototype Learning

Add Prototypes Manually constructed Grammar Induction Experiments

How to use prototypes? ? ? ? ? DT The NN koala VBD sat IN in DT the NN tree ¦¦ ? NP PP VP S

How to use prototypes? NP PP VP DT The NN koala VBD sat IN in DT the NN tree ¦ ¦ S JJ hungry ?NP

Distributional Similarity Context Distribution  (DT JJ NN) = { ¦ __ VBD : 0.3, VBD __ ¦ : 0.2, IN __ VBD: 0.1, ….. } Similarity  (DT NN)  (DT JJ NN)  (JJ NNS)  (NNP NNP) NP

Distributional Similarity Prototype Approximation  (NP) ¼ Uniform (  (DT NN),  (JJ NNS),  (NNP NNP) ) Prototype Similarity Feature span(DT JJ NN) emits proto=NP span(MD NNS) emits proto=NONE

Prototype CFG + Model NP PP VP DT The NN koala VBD sat IN in DT the NN tree ¦ ¦ S JJ hungry NP P (DT NP | NP) P (proto=NP | NP)

Prototype CFG+ Induction Experimental Set-Up BLIPP corpus Gold Brackets Results

Summary So Far Bracket constraint and prototypes give good performance!

Constituent-Context Model

Product Model Different Aspects of Syntax CCM : Yield and Context properties CFG : Hierarchical properties Intersected EM [Klein 2005] Encourages mass on trees compatible with CCM and CFG

Grammar Induction Experiments Intersected CFG and CCM No prototypes Results

Grammar Induction Experiments Intersected CFG + and CCM Add Prototypes Results

Reacting to Errors Possessive NPs Our Tree Correct Tree

Reacting to Errors Add Prototype: NP-POS NN POS New Analysis

Error Analysis Modal VPs Our TreeCorrect Tree

Reacting to Errors Add Prototype: VP-INF VB NN New Analysis

Fixing Errors Supplement Prototypes NP-POS and VP-INF Results

Results Summary

Conclusion Prototype-Driven Learning Flexible Weakly Supervised Framework Merged distributional clustering techniques with supervised structured models

Thank You! http://www.cs.berkeley.edu/~aria42

Unconstrained PCFG Induction Binary Grammar { X 1, X 2, … X n } Learn PCFG with EM Inside-Outside Algorithm Lari & Young [93] 0 i j n  (Inside)  (Outside) V XjXj X i XkXk N XkXk XjXj V N

Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.

Similar presentations

Presentation on theme: "Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.

Similar presentations

Presentation on theme: "Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley."— Presentation transcript:

Similar presentations

About project

Feedback