Download presentation
Presentation is loading. Please wait.
Published byNathaniel Dixon Modified over 9 years ago
1
1 Word senses: a computational response Adam Kilgarriff
2
Kivik 2013 Kilgarriff: Word senses: a computational response2
3
Kivik 2013 Kilgarriff: Word senses: a computational response3 My PhD (in 5 slides) What is a word sense
4
Kivik 2013 Kilgarriff: Word senses: a computational response4 The lexicographers They create them Methods Introspection Other dictionaries Corpus Atkins, Hanks, Krishnamurthy
5
Kivik 2013 Kilgarriff: Word senses: a computational response5 What is a word sense (1) SFIP Sufficiently frequent insufficiently predictable (a glass of) whisky x (a glass of) tequila
6
Kivik 2013 Kilgarriff: Word senses: a computational response6 What is a word sense (2) homonymy analogy polysemy rules collocation
7
Kivik 2013 Kilgarriff: Word senses: a computational response7 What is a word sense (3) A cluster Of instances of use Operationalised as: corpus lines Clustered by lexicographers
8
Kivik 2013 Kilgarriff: Word senses: a computational response8 What is a word sense (3)
9
Kivik 2013 Kilgarriff: Word senses: a computational response9 What is a word sense (3)
10
Kivik 2013 Kilgarriff: Word senses: a computational response10 What is a word sense (3)
11
Kivik 2013 Kilgarriff: Word senses: a computational response11 What is a word sense (3)
12
Kivik 2013 Kilgarriff: Word senses: a computational response12 What is a word sense (3) A cluster Of instances of use Operationalised as: corpus lines Clustered by lexicographers Makes sense of Overlapping senses Different dictionaries, different senses Lumping and splitting
13
Kivik 2013 Kilgarriff: Word senses: a computational response13 I don’t believe in word senses Believe in: resurrection ghost witch vampire god miracle fairy Philosophy: Ontological commitment (same meaning different register) “good entities to build belief systems on”
14
Kivik 2013 Kilgarriff: Word senses: a computational response14 A word sense is a cluster of corpus lines But I’m an NLP person Automatic clustering? Inspiration: Hindle 1991, Schütze 1993, Grefenstette 1993, Lin 1999 You can get semantic sense from corpora+stats
15
Kivik 2013 Kilgarriff: Word senses: a computational response15 First attempt Longman 1994 Abject failure No grammar Corpus too small and noisy Naïve clustering Useless programmer
16
Kivik 2013 Kilgarriff: Word senses: a computational response16 Collocations Easy Most words don’t go with most other words Then build on what we can do well metaphor, analogy, homonymy, rules all much harder
17
Kivik 2013 Kilgarriff: Word senses: a computational response17 Clustering Word sketch Collocates organised by grammar Dictionary Collocates (and other things) organised by meaning How to re-organise
18
Kivik 2013 Kilgarriff: Word senses: a computational response18 Observation: corpus: arbitrary sample dictionary ( =lexicon) : systematic account Children encounter arbitrary samples develop systematic account
19
Kivik 2013 Kilgarriff: Word senses: a computational response19 Corpus provisional, dispensable used to develop lexicon
20
Kivik 2013 Kilgarriff: Word senses: a computational response20 Levels of abstraction Direct linkage: Fragile Updates (to C or D) break links Dictionary: abstract Corpus: raw Intermediate level needed CorpusDictionary === ===
21
Kivik 2013 Kilgarriff: Word senses: a computational response21 How most automatic word sense disambiguation (WSD) works Analyse dictionary to give set of collocates Match to collocates in a corpus Dispensable corpus CorpusDictionary === === === === Collocates
22
Kivik 2013 Kilgarriff: Word senses: a computational response22 Not just collocates triples parse the corpus some “unary relations” I hear him singing domain-based clues Collocates, Constructions, Domains = CoCoDo
23
Kivik 2013 Kilgarriff: Word senses: a computational response23 Automatically extract CoCoDos from corpus How linked to senses? Automatic (WSD techniques) Manual “dictionary-free”: ideal for new dictionaries Labour costs Mixed WSD with manual confirmation/correction CorpusDictionary === === === === CoCoDo CoCoDo Linking CoCoDo’s to senses
24
Kivik 2013 Kilgarriff: Word senses: a computational response24 Semi-automatic dictionary drafting (SADD) CoCoDo database Automatic clustering Lexicographer input More clustering Dictionary with corpus inside
25
Kivik 2013 Kilgarriff: Word senses: a computational response25 Semi-automatic dictionary drafting (SADD) CoCoDo database Automatic clustering Lexicographer input More clustering Dictionary with corpus inside hard
26
Related projects Dante (completed 2010) Tickbox lexicography Demo Automatic collocations dictionaries SkE Language Resources And Tools Kivik 2013 Kilgarriff: Word senses: a computational response26
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.