X Ambiguity & Variability The Challenge The Wikifier Solution

Slides:



Advertisements
Similar presentations
Linking Entities in #Microposts ROMIL BANSAL, SANDEEP PANEM, PRIYA RADHAKRISHNAN, MANISH GUPTA, VASUDEVA VARMA INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY,
Advertisements

October 2014 Paul Kantor’s Fusion Fest Workshop Making Sense of Unstructured Data Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign.
Linking Named Entity in Tweets with Knowledge Base via User Interest Modeling Date : 2014/01/22 Author : Wei Shen, Jianyong Wang, Ping Luo, Min Wang Source.
Textual Relations Task Definition Annotate input text with disambiguated Wikipedia titles: Motivation Current state-of-the-art Wikifiers, using purely.
Encyclopaedic Annotation of Text.  Entity level difficulty  All the entities in a document may not be in reader’s knowledge space  Lexical difficulty.
Global and Local Wikification (GLOW) in TAC KBP Entity Linking Shared Task 2011 Lev Ratinov, Dan Roth This research is supported by the Defense Advanced.
Contacts. Stored in the Contact folder Stores information about businesses or people –Address, phone number, address etc.
Made by: Dan Ye. Introduction Basic Last Page ☆ HTML stands for Hyper Text Markup Language; ☆ HTML is not a programming language, it is a markup language;
Digital textNamed Entities Hovering over a named entity highlights the areas where it appears in the text.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Named Entity Disambiguation Based on Explicit Semantics Martin Jačala and Jozef Tvarožek Špindlerův Mlýn, Czech Republic January 23, 2012 Slovak University.
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
Introduction to Internet Engineering Tutorial 7 All about Assignment 2 By Tse Hok
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
C OLLECTIVE ANNOTATION OF WIKIPEDIA ENTITIES IN WEB TEXT - Presented by Avinash S Bharadwaj ( )
Processing of large document collections Part 10 (Information extraction: multilingual IE, IE from web, IE from semi-structured data) Helena Ahonen-Myka.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
University of Sheffield, NLP Entity Linking Kalina Bontcheva © The University of Sheffield, This work is licensed under the Creative Commons.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
How Much Do You Know? Web Literacy Web Quest BY: John Cork.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
NoteSearch - Find what you’re looking for. Prototype Team B.
Page 1 March 2011 Local and Global Algorithms for Disambiguation to Wikipedia Lev Ratinov 1, Dan Roth 1, Doug Downey 2, Mike Anderson 3 1 University of.
Exploiting Wikipedia Categorization for Predicting Age and Gender of Blog Authors K Santosh Aditya Joshi Manish Gupta Vasudeva Varma
Mobile Search Engine Based on idea presented in paper Data mining for personal navigation, Hariharan, G., Fränti, P., Mehta S. (2002)
HTML Basic. What is HTML HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is not a programming language, it.
Page 1 INARC Report Dan Roth, UIUC March 2011 Local and Global Algorithms for Disambiguation to Wikipedia Lev Ratinov & Dan Roth Department of Computer.
2015/12/121 Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Proceeding of the 18th International.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Date: 2013/10/23 Author: Salvatore Oriando, Francesco Pizzolon, Gabriele Tolomei Source: WWW’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang SEED:A Framework.
LINDEN : Linking Named Entities with Knowledge Base via Semantic Knowledge Date : 2013/03/25 Resource : WWW 2012 Advisor : Dr. Jia-Ling Koh Speaker : Wei.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
PART 1 XML Basics. Slide 2 Why XML Here? You need to understand the basics of XML to do much with Android All of they layout and configuration files are.
Welcome to MT140 Introduction to Management Unit 1 Seminar – Introduction to Management.
PowerPoint Adding Hyperlinks and Hiding Slides Learn to Link to websites and other slides in the presentation! Adding Hyperlinks and Hiding Slides Learn.
Food and Agriculture Organization of the UN GILW Library and Documentation Systems Division Food, Nutrition and Agriculture Ontology Portal.
Cross-Lingual Named Entity Recognition via Wikification
Illinois CCG LoReHLT16 Situation Frame System
Best pTree organization? level-1 gives te, tf (term level)
Showcasing work by Jonnageddala, Liaw, Ray, Kumar, Chang, and Dai on
of customers say buying decisions are influenced by
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Graphing Relationships
GLOW- Global and Local Algorithms for Disambiguation to Wikipedia
Lecture 24: NER & Entity Linking
Text Analytics in ITS 2.0: Annotation of Named Entities
Programming Language Syntax 2
Presentation 王睿.
Relational Inference for Wikification
OUR GIFT TO YOU... As our gift to you, select a subscription
إستراتيجيات ونماذج التقويم
A Level English Language
Searching EIT, Author Gay Robertson, 2017.
Discovering Emerging Entities with Ambiguous Names
Advanced search techniques in databases
FUNCTION NOTATION AND EVALUATING FUNCTIONS
A Framework for Benchmarking Entity-Annotation Systems
Introduction to Information Retrieval
5.2 Relations and Functions
NUR2300 – Guide to Searching ClinicalKey for Nursing
Text Annotation: DBpedia Spotlight
Social Practice of the language: Describe and share information
Relations/Sequences Objective: Students will learn how to identify if a relation is a function. They will also be able to create a variable expression.
UNDERSTANDING FUNCTIONS
Entity Linking Survey
Title Goes Here Title Goes Here Title Goes Here Title Goes Here
Language in the Media Lesson 2.
Text Analytics in ITS 2.0: Annotation of Named Entities
Recapitulation of Lecture 13
Presentation transcript:

X Ambiguity & Variability The Challenge The Wikifier Solution Try the demo at http://cogcomp.org/page/demo_view/xl_wikifier takes input text (web pages, Word docs, plain text) identifies and disambiguates important entities and concepts returns text enhanced with links to Encyclopedic resources (Wikipedia) describing contextually unambiguous concepts. Keywords: Entity Resolution & Linking; Contextual Disambiguation; Grounding to Reference Records Developed by: Chen-Tse Tsai, Chase Duncan, Liang-Wei Chen, Shyam Upadhyay, Nitish Gupta, Mark Sammons Ambiguity & Variability There are many ways to say the same thing. Wikifier uses context to recognize what is meant and ground it in the most specific Wikipedia page. (It would also recognize “Kansas City” here, just as it links “San Diego” to the Chargers’ page). Which “Alex Smith” are we talking about? The Challenge Much of the world’s information is expressed as natural language text – in documents such as emails, reports, articles, blogs, social media, … An entity or concept mentioned in text is often ambiguous, and could refer to multiple different entities. Also, different surface representations can refer to the same entity or concept. Wikifier addresses these problems by (i) contextually disambiguating entities and concepts and (ii) grounding entities and concepts in encyclopedic resources that provide additional knowledge on them. This opens up a range of opportunities for better understanding of the text and accessing it at the level of concepts, not keywords. The Wikifier Solution Γ* is a solution to the problem, a set of mention-title pairs (m,t). Φ(m,t) scores how well mention m maps to title t based on local context Ψ(ti,tj) scores how likely two titles ti, tj are to appear in a document A global assignment of all mentions to titles is evaluated based on local context, relational structure of the text and global coherence. Wikifier links to the Chiefs’ QB, not the Bengals’ TE. Who is Abu Mazen? Who is Mahmoud Abbas? Wikifier can help (in many languages)… Variability X Cross-Lingual Wikification