Download presentation
Presentation is loading. Please wait.
Published byKatelyn Volk Modified over 9 years ago
1
A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC 259234
2
Lexical Resource WordNet BabelNet UBY
3
Lexical Resource WordNet BabelNet UBY
4
Lexical Resource WordNet BabelNet UBY
5
Improved word and concept coverage e.g., named entities, new senses Improved domain coverage Improved multilinguality dozens of new languages Expert-made relations preserved e.g., Hypernymy, meronymy, etc. Why combine resources?
6
Provides complementary knowledge Applications: Semantic Parsing Shi and Mihalcea, 2005 Semantic Role Labeling Palmer et al., 2010 WSD and entity linking Moro et al., TACL 2014 Why combine resources?
7
A synset from BabelNet
8
Why combine resources? A synset from BabelNet for plant (living organism) a case example BabelNet
9
Why combine resources? A synset from BabelNet for plant (living organism)
10
Why combine resources?
11
A synset from BabelNet for plant (living organism)
12
Why combine resources? A synset from BabelNet for plant (living organism)
13
Difficulty of resource alignment Fine granularity of lexical resources plant WordNet 4 senses15 senses
14
How resource alignment works? WordNet plant#n#1 Usually measures the similarity of two concepts WKT: WN:
15
How resource alignment works? Usually measures the similarity of two concepts And aligns two concepts if their similarity exceeds a certain threshold ? WordNet plant#n#1 WKT: WN:
16
How resource alignment works? Alignment approaches differ in the way they calculate this similarity ? WordNet plant#n#1 WKT: WN:
17
Gloss similarity WordNet Denfinitional similarity
18
Strong baseline Fall short when Different wordings are used for same concepts When two words lack quality glosses key -- Metal device shaped in such a way that when it is inserted into the appropriate lock the lock's mechanism can be rotated Key -- An object designed to open and close a lock Denfinitional similarity
19
Strong baseline Fall short when Different wordings are used for same concepts When two words lack quality glosses plant -- Buildings for carrying on industrial labor. plant -- The necessary infrastructure used in support and maintenance of a given facility. Denfinitional similarity
20
Contributions A novel concept similarity measure Denfinitional similarity A robust technique for alignment of resources A robust technique for alignment of heterogeneous resources An effective ontologization approach
21
Our approach: SemAlign
22
Definition similarity
23
Our approach: SemAlign Structural similarity
24
SemAlign: structural similarity WordNet
25
SemAlign: core Modeling concepts through Semantic Signatures
26
SemAlign: core Semantic Signatures of concepts
27
some Personalized PageRank Semantic Signature of a concept
28
Distributional representation over all concepts in the semantic network Personalized PageRank Semantic Signature of a concept
29
Distributional representation over all concepts in the semantic network...
30
Semantic Signature Example WordNet concept for ----- Importance of concept_3 for our concept...
31
Semantic Signature Importance of concept_3 for our concept... cooking#1 frying_pan#1 oil#4 kitchen#3 Example WordNet concept for -----
32
Semantic Signature Importance of concept_3 for our concept... table#3 carpet#2 sugar#2 natural_gas#2 Example WordNet concept for -----
33
Semantic Signature Importance of concept_3 for our concept... read#3 bank#2 Example WordNet concept for -----
34
SemAlign Definition similarity WordNet
35
SemAlign WordNet
36
SemAlign: signature unification
37
WordNet SemAlign: signature unification Find concepts associated with monosemous words
38
WordNet SemAlign: signature unification Truncate vectors to the overlapping concepts
39
SemAlign: signature unification WordNet Synsets containing at least one monosemous word ~ 60% (72,000) The reliability of leveraging monosemous words
40
Semantic Signature Comparison
41
Weighted Overlap (Pilehvar et al., ACL 2013) Semantic Signature Comparison
42
Comparing Semantic Signatures Weighted Overlap
46
SemAlign: score combination
47
Ontologization of lexical resources
48
WordNet Ontologization of lexical resources
49
WordNet Ontologization of lexical resources
52
WordNet Ontologization of lexical resources
53
WordNet Ontologization of lexical resources
55
Definition page for sail
56
WordNet Ontologization of lexical resources
57
WordNet Ontologization of lexical resources
59
WordNet Ontologization of lexical resources
60
Definition page for windmill 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Ontologization of lexical resources
61
Definition page for windmill 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. windmill.n.1 linear motion.n.1 rotational motion.n.1 translate.v.3 wind.n.1 machine.n.1 sail.n.3 adjustable vanes.v.4 Ontologization of lexical resources
62
Ontologization : similarity-based disambiguation Definition page for windmill 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.
63
Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill
64
Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill
65
Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill
66
Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill
67
Ontologization: evaluation (Meyer & Gurevych, 2012)
68
Alignment Experiments: Datasets (Matuschek & Gurevych, TACL 2013) WordNet WN synsets manually mapped to their corresponding concepts 320 484 315
69
Alignment Experiments: System configurations Parameter 1: score combination parameter
70
Alignment Experiments: System configurations Parameter 2: similarity threshold
71
Alignment Experiments: System configurations 01 0.5 Setting and a. Unsupervised system
72
Alignment Experiments: System configurations 01 a. Unsupervised system Setting and b. Tuning on subset 0.42 0.53
73
Alignment Experiments: System configurations a. Unsupervised system Setting and b. Tuning on subset c. Cross validation 01 0.44 0.51
74
Comparison system Dijkstra-WSA (Matuschek & Gurevych, TACL 2013)
75
Alignment Experiments SB SB+DWSA F1
76
Alignment Experiments F1 SB SB+DWSA
77
SemAlign: structural similarity
78
Alignment Experiments F1
79
Conclusions A novel approach for aligning lexical resources –Accurate even in the absence of training data –Robust across different resources An effective ontologization approach Experiments on aligning –WN to WP, WT, and OW
80
Future directions Integrating the approach into BabelNet for boosting the alignment accuracy Alignment across different languages Updating lexicons with novel terms
81
1. thanks -- an acknowledgment of appreciation 2. thanks -- with the help of or owing to; 1.thanks -- an expression of gratitude. 2. thanks -- because of; normally used with a positive connotation, though it can be used sarcastically. WordNet Thanks!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.