Download presentation
Presentation is loading. Please wait.
1
Dimension of Meaning Author: Hinrich Schutze Presenter: Marian Olteanu
2
Introduction Represent context as vectors Dimensions of space – words Initial vectors – determined by word occurrence This paper – reduce dimensionality by singular value decomposition Applications WSD Thesaurus induction
3
Introduction Classic scheme in IR Documents are represented as vectors of words in term space Extension – represent contexts as vectors of words within a fixed window Disadvantage – content can be expressed with different words, close in meaning This approach Represent words as term vectors that reflect their pattern of usage in a large corpus
4
Introduction Dimension in this space: Cash Sport Measure Cosine of the angle between vectors
5
Introduction Compute a representation of context more robust than bag-of-words Centroid (normalized average) of the vectors of the words in a context Practical applications Thousands of dimensions (words) Matrix of concurrence with only 10% zeros
6
Application WSD Done by clustering the contexts AutoClass Buckshot Assign a sense for each cluster
7
Word space
8
Window size, dimension sets
9
Discussion Resembles LSI Uses SVD Purpose of space reduction LSI – improve the quality of representation (because of null values) This paper Reducing the computation Detection of term dependencies (similar terms) SVD doesn’t influence accuracy of WSD
10
Discussion Small number of parameters (thousands) compared to other statistical approaches (i.e.: trigrams)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.