HyperLex: lexical cartography for information retrieval Jean Veronis Presented by: Siddhanth Jain(113050015) Samiulla Shaikh(113050032)

Slides:



Advertisements
Similar presentations
February 20, Spatio-Temporal Bandwidth Reuse: A Centralized Scheduling Mechanism for Wireless Mesh Networks Mahbub Alam Prof. Choong Seon Hong.
Advertisements

Chapter 5: Tree Constructions
Bart Jansen 1.  Problem definition  Instance: Connected graph G, positive integer k  Question: Is there a spanning tree for G with at least k leaves?
Every edge is in a red ellipse (the bags). The bags are connected in a tree. The bags an original vertex is part of are connected.
Lecture 5 Graph Theory. Graphs Graphs are the most useful model with computer science such as logical design, formal languages, communication network,
Self Stabilizing Algorithms for Topology Management Presentation: Deniz Çokuslu.
Data Structure and Algorithms (BCS 1223) GRAPH. Introduction of Graph A graph G consists of two things: 1.A set V of elements called nodes(or points or.
Chapter 15 Graph Theory © 2008 Pearson Addison-Wesley. All rights reserved.
A Randomized Linear-Time Algorithm to Find Minimum Spanning Trees David R. Karger David R. Karger Philip N. Klein Philip N. Klein Robert E. Tarjan.
Complexity 11-1 Complexity Andrei Bulatov NP-Completeness.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
Computability and Complexity 15-1 Computability and Complexity Andrei Bulatov NP-Completeness.
3 -1 Chapter 3 The Greedy Method 3 -2 The greedy method Suppose that a problem can be solved by a sequence of decisions. The greedy method has that each.
Approximation Algorithms Lecture for CS 302. What is a NP problem? Given an instance of the problem, V, and a ‘certificate’, C, we can verify V is in.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
1 Vertex Cover Problem Given a graph G=(V, E), find V' ⊆ V such that for each edge (u, v) ∈ E at least one of u and v belongs to V’ and |V’| is minimized.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
[1][1][1][1] Lecture 2-3: Coping with NP-Hardness of Optimization Problems in Practice May 26 + June 1, Introduction to Algorithmic Wireless.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
CS 1114: Graphs and Blobs Prof. Graeme Bailey (notes modified from Noah Snavely, Spring 2009)
Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.
The Euler-tour technique
Semantic Video Classification Based on Subtitles and Domain Terminologies Polyxeni Katsiouli, Vassileios Tsetsos, Stathes Hadjiefthymiades P ervasive C.
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
Minimum Spanning Trees. Subgraph A graph G is a subgraph of graph H if –The vertices of G are a subset of the vertices of H, and –The edges of G are a.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
(ACM KDD 09’) Prem Melville, Wojciech Gryc, Richard D. Lawrence
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
© The McGraw-Hill Companies, Inc., Chapter 3 The Greedy Method.
Graph Theory Topics to be covered:
Advanced Algorithm Design and Analysis (Lecture 13) SW5 fall 2004 Simonas Šaltenis E1-215b
COSC 2007 Data Structures II Chapter 14 Graphs III.
Graph-based Analysis of Espresso-style Minimally-supervised Bootstrapping Algorithms Jan 15, 2010 Mamoru Komachi Nara Institute of Science and Technology.
Module 5 – Networks and Decision Mathematics Chapter 23 – Undirected Graphs.
Introduction to Graphs. Introduction Graphs are a generalization of trees –Nodes or verticies –Edges or arcs Two kinds of graphs –Directed –Undirected.
Data Structures Week 9 Introduction to Graphs Consider the following problem. A river with an island and bridges. The problem is to see if there is a way.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
Handover and Tracking in a Camera Network Presented by Dima Gershovich.
InterConnection Network Topologies to Minimize graph diameter: Low Diameter Regular graphs and Physical Wire Length Constrained networks Nilesh Choudhury.
Intelligent Database Systems Lab Presenter : Kung, Chien-Hao Authors : Yoong Keok Lee and Hwee Tou Ng 2002,EMNLP An Empirical Evaluation of Knowledge Sources.
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.
Presenter : Kuang-Jui Hsu Date : 2011/3/24(Thur.).
Unit – V Graph theory. Representation of Graphs Graph G (V, E,  ) V Set of vertices ESet of edges  Function that assigns vertices {v, w} to each edge.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Unsupervised word sense disambiguation for Korean through the acyclic weighted digraph using corpus and.
Graphs. Introduction Graphs are a collection of vertices and edges Graphs are a collection of vertices and edges The solid circles are the vertices A,
© 2006 Pearson Addison-Wesley. All rights reserved 14 A-1 Chapter 14 Graphs.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Spanning Tree Definition:A tree T is a spanning tree of a graph G if T is a subgraph of G that contains all of the vertices of G. A graph may have more.
Introduction Wireless Ad-Hoc Network  Set of transceivers communicating by radio.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
An Adaptive Learning with an Application to Chinese Homophone Disambiguation from Yue-shi Lee International Journal of Computer Processing of Oriental.
Graph-based WSD の続き DMLA /7/10 小町守.
Graph Search Applications, Minimum Spanning Tree
Minimum Spanning Trees
Redraw these graphs so that none of the line intersect except at the vertices B C D E F G H.
Discrete Math 2 Weighted Graph Search Tree
Minimum Spanning Trees
Connected Components Minimum Spanning Tree
Lectures on Graph Algorithms: searching, testing and sorting
Introduction Wireless Ad-Hoc Network
Weighted Graphs & Shortest Paths
Minimum Spanning Trees (MSTs)
Unsupervised Word Sense Disambiguation Using Lesk algorithm
Chapter 14 Graphs © 2011 Pearson Addison-Wesley. All rights reserved.
Presentation transcript:

HyperLex: lexical cartography for information retrieval Jean Veronis Presented by: Siddhanth Jain( ) Samiulla Shaikh( )

Motivation Human language is ambigous. Many words can have multiple meanings depending on context, domain, region etc. Such instances of words are known as polysemous. It is very easy to disambiguate these words for humans but for machines its a difficult job. Task of dismabiguating polysemous words is known as Word Sense Disambiguatio(WSD). WSD is one of the most fundamental problems in the field of NLP.

Motivation Contd... Supervised approaches to WSD give high accuracy. They need large amount of training data. Many languages and domains lack such kind of data. Hence semi-supervised and unsupervised approaches are also emerging as prominent options for WSD.

Various approaches to WSD

HyperLex Hyperlex is one of the most famous unsupervised approach for Word Sense Disambiguation. HyperLex is capable of auto-matically determining the uses of a word in a textbase without recourse to a dictionary. Despite of being unsupervised it has been found to be comparable to state-of-the-art supervised approaches.

Terminology Co-occurence Graphs For each target word we take all the words that co- occure with it and treat them as nodes. We create an edged between two nodes, A and B if their corresponding words co-occur with each other.

Terminology Contd.. Small World Graph A small-world graph is a type of mathematical graph in which most nodes are not neighbors of one another, but most nodes can be reached from every other by a small number of hops or steps. Milgram (1967), was the first who proposed theterm ‘‘small world’’: any individual on the planet is only ‘‘six degrees away’’ from any other individual in the graph of social relations, even though there are several billion inhabitants.

Assigning weights to edges The weight that we assign to each edge reflects the magnitude of the 'semantic distance' between words: When w=0, the words always co-occured When w=1, the words never co-occured

Assigning Weights Each edge is assigned a weight that decreases as the association frequency of the words increases: Where is the conditional probability of observing A in a given context, knowing that context contains B and vice versa.

Example

Co-Occurence Graph

Finding Connected Components Detecting the different uses of a word thus amounts to isolating the high-density components in its cooccurrence graph. Unfortunately, most exact graph-partitioning techniques are NP-hard. The author has given an approximate algorithm which gives fairly good results.

Root Hub In every high-density component, one of the nodes has a higher degree than the others; it is called the component’s root hub. For example, for the most frequent use of bank, the root hub is the word money. It is easy to find, since it is the hub with the highest degree in the graph (and it is also the most frequent word).

Detecting Root Hubs First find the highest degree node and call it the first root hub. Now delete the selected root hub along with all its neighbours. Repeat this process untill either all nodes have been covered or there is no elligible vertex for a root hub. A vertex is considered to be eligible for being a root hub if the 6 most frequent neighbours have weights less than 0.8(found experimentally)

Co-Occurence Graph

First Root Hub Detected

Neighbours Identified

Second Root Hub Identified from remaining graph

Neighbours Identified

Delineating Components Once all the root hubs have been found connect all of them with the target word with 0 edge weight in co-occurence graph. Now find the MST(Minimum Spanning Tree) for this graph.

MST Target word is assigned a level 0. All root hubs are assigned a level 1. All nodes at level 1 represents different senses of the target word and each one of them represents a component.

Assigning Scores Each node in the tree is assigned a score vector of the size of the number of components. If v belongs to component i Otherwise Now for each node in the tree we have a score vector.For example S(water)=(0,x) x>=0

Disambiguation Now we add the score vectors of all vertices present in the context. The target word is assigned the sense corresponding to the winning root hub.

Testing Results

Conclusion Hyperlex in one of the most successful unsupervised approach for WSD. It doesn't need any external lexical resource for disambiguation. Its accuracy with small number of words is comparable to state-of-the-art supervised WSD approaches.

References VE ́RONIS, J Hyperlex: Lexical cartography for information retrieval. Comput. Speech Lang. 18, 3,223–252 Navigli, R Word Sense Disambigua-tion: a survey. ACM Computing Surveys, 41(2):1–69