Multi-Task Transfer Learning for Weakly- Supervised Relation Extraction Jing Jiang Singapore Management University ACL-IJCNLP 2009
Aug 5, 2009ACL-IJCNLP Relation Extraction Task definition: to label the semantic relation between a pair of entities in a sentence (fragment) …[leader arg-1 ] of a minority [government arg-2 ]… PHYSPER-SOCEMP-ORGNIL PHYS: Physical PER-SOC: Personal / Social EMP-ORG: Employment / Membership / Subsidiary
Aug 5, 2009ACL-IJCNLP Supervised Learning Current solution: supervised machine learning (e.g. [Zhou et al. 2005], [Bunescu & Mooney 2005], [Zhang et al. 2006]) Training data is needed for each relation type …[leader arg-1 ] of a minority [government arg-2 ]… arg-1 word: leaderarg-2 type: ORG dependency: arg-1 of arg-2 EMP-ORGPHYSPER-SOCNIL
Aug 5, 2009ACL-IJCNLP Challenge in Practice New relation type (in a new domain): no training data or a few seed instances In this work, we study weakly-supervised relation extraction –A few seed instances of the target relation type –Many instances of other auxiliary relation types –Additional human knowledge about the target relation type Main idea: Auxiliary relation types can help!
Aug 5, 2009ACL-IJCNLP Syntactic Similarity across Relation Types …[leader arg-1 ] of a minority [government arg-2 ]… arg-1 word: leaderarg-2 type: ORG dependency: arg-1 of arg-2 the youngest [son arg-1 ] of ex-director [Suharto arg-2 ] the [Socialist People’s Party arg-1 ] of [Montenegro arg-2 ] EMP-ORG PER-SOC GPE-AFF
Aug 5, 2009ACL-IJCNLP Syntactic Similarity Syntactic Pattern Relation InstanceRelation Type (Subtype) arg-2 arg-1Arab leadersOTHER-AFF (Ethnic) his fatherPER-SOC (Family) South Jakarta Prosecution Office GPE-AFF (Based-in) arg-1 [verb] arg-2Yemen [sent] planes to Baghdad ART (User-or- Owner) His wife [had] three young children PER-SOC (Family) Jody Scheckter [paced] Farrari to both victories EMP-ORG (Employ- Staff)
Aug 5, 2009ACL-IJCNLP Problem Formulation based on Transfer Learning Domain adaptation and transfer learning (e.g. [Blitzer et al. 2006], [Hal Daume III 2007]) our goal: PER-SOCEMP-ORG We apply our previous framework ([Jiang & Zhai 2007b]) –Similar in spirit to [Evgeniou & Pontil 2004] and [Daume III, 2007]
Aug 5, 2009ACL-IJCNLP Review of Relation Extraction Basics Linear classifier …[leader arg-1 ] of a minority [government arg-2 ]… arg-2 type: ORG arg-2 type: PER dependency: arg-1 of arg arg-2 type: ORG feature vectorweight vector in linear classifier dependency: arg-1 of arg-2 EMP-ORG
Aug 5, 2009ACL-IJCNLP General vs. Specific Features Assumption: some features are commonly useful for different relation types, while other features are specific for individual relation types : weight vector for target type : weight vector for k’th auxiliary type common weight vector in a lower H dimensional space
Aug 5, 2009ACL-IJCNLP Learning Framework loss function on the target seed instances loss function on the auxiliary training instances
Aug 5, 2009ACL-IJCNLP General Features Which subset of features should be captured by ? common weight vector in a lower H dimensional space
Aug 5, 2009ACL-IJCNLP Feature Separation Automatic separation within the learning framework (see [Jiang & Zhai 2007b]) Human guidance –Argument word features: features that contain head word of an argument E.g. arg-1 word: sister –Entity type features: features that contain the entity type (subtype) of an argument E.g. arg-2 type: ORG Combined
Aug 5, 2009ACL-IJCNLP Imposing Entity Type Constraint Fix the possible entity types for the arguments for the target relation type Filter out the relation instances that do not satisfy the constraint in the end
Aug 5, 2009ACL-IJCNLP Experiment Setup ACE 2004, 7 relation types –6 types auxiliary types 1 type target type 5-fold cross validation # seed instances: 10
Aug 5, 2009ACL-IJCNLP Methods Compared BL: train on seed instances only BL-A: train on seed and auxiliary training instances together w/o feature separation TL-auto: transfer learning w/ automatic feature separation TL-guide: transfer learning w/ human-guided feature separation TL-comb: automatic feature separation combined with human guidance TL-NE: TL-comb + entity type constraint
Aug 5, 2009ACL-IJCNLP Comparison Target TypeBLBL-ATL-autoTL- guide TL- comb TL-NE PhysicalP R F Personal/SocialP R F Employment /Membership /Subsidiary P R F AverageP R F
Aug 5, 2009ACL-IJCNLP Effect of λ λμTλμT P R F Performance of TL-comb. λ μ k = 10 4, λ ν = 1.
Aug 5, 2009ACL-IJCNLP Number of Seed Instances
Aug 5, 2009ACL-IJCNLP Sensitivity of H
Aug 5, 2009ACL-IJCNLP Conclusions We proposed to apply a multi-task transfer learning framework to the weakly-supervised relation extraction problem. We defined two kinds of type-specific features. Our experiments show that automatic feature separation combined with human guidance and entity type constraint can significantly outperform the baselines.
Aug 5, 2009ACL-IJCNLP Thank You! Questions?
Aug 5, 2009ACL-IJCNLP Related Work [Zhou et al. 2008]: Different way of modeling commonality among relation types. [Banko & Etzioni, 2008]: Open-domain relation extraction. No target relation type. [Xu et al. 2008]: Rule-based adaptation. Same type.
Aug 5, 2009ACL-IJCNLP Hypothesized Type-Specific Features