Summarizing Entities: A Survey Report

Slides:



Advertisements
Similar presentations
Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
Advertisements

Query Chain Focused Summarization Tal Baumel, Rafi Cohen, Michael Elhadad Jan 2014.
Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Improved TF-IDF Ranker
DISCOVER: Keyword Search in Relational Databases Vagelis Hristidis University of California, San Diego Yannis Papakonstantinou University of California,
Comparing Twitter Summarization Algorithms for Multiple Post Summaries David Inouye and Jugal K. Kalita SocialCom May 10 Hyewon Lim.
Encyclopaedic Annotation of Text.  Entity level difficulty  All the entities in a document may not be in reader’s knowledge space  Lexical difficulty.
Aki Hecht Seminar in Databases (236826) January 2009
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Web Projections Learning from Contextual Subgraphs of the Web Jure Leskovec, CMU Susan Dumais, MSR Eric Horvitz, MSR.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Query Biased Snippet Generation in XML Search Yi Chen Yu Huang, Ziyang Liu, Yi Chen Arizona State University.
Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Information Retrieval in Practice
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Estimating Importance Features for Fact Mining (With a Case Study in Biography Mining) Sisay Fissaha Adafre School of Computing Dublin City University.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Extracting Semantic Constraint from Description Text for Semantic Web Service Discovery Dengping Wei, Ting Wang, Ji Wang, and Yaodong Chen Reporter: Ting.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Querying Structured Text in an XML Database By Xuemei Luo.
11 A Hybrid Phish Detection Approach by Identity Discovery and Keywords Retrieval Reporter: 林佳宜 /10/17.
LexRank: Graph-based Centrality as Salience in Text Summarization
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Discovery from Linking Open Data (LOD) Annotated Datasets Louiqa Raschid University of Maryland PAnG/PSL/ANAPSID/Manjal.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Chapter 6: Information Retrieval and Web Search
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.
DOCUMENT UPDATE SUMMARIZATION USING INCREMENTAL HIERARCHICAL CLUSTERING CIKM’10 (DINGDING WANG, TAO LI) Advisor: Koh, Jia-Ling Presenter: Nonhlanhla Shongwe.
Department of Software and Computing Systems Research Group of Language Processing and Information Systems The DLSIUAES Team’s Participation in the TAC.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Semi-automatic Product Attribute Extraction from Store Website
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
A Survey on Automatic Text Summarization Dipanjan Das André F. T. Martins Tolga Çekiç
Research Progress Kieu Que Anh School of Knowledge, JAIST.
An Integrated Approach for Relation Extraction from Wikipedia Texts Yulan Yan Yutaka Matsuo Mitsuru Ishizuka The University of Tokyo WWW 2009.
Reading literacy. Definition of reading literacy: “Reading literacy is understanding, using and reflecting on written texts, in order to achieve one’s.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Queensland University of Technology
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Structured Browsing for Unstructured Text
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Multimedia Information Retrieval
Social Knowledge Mining
Thanks to Bill Arms, Marti Hearst
Gong Cheng, Yanan Zhang, and Yuzhong Qu
#VisualHashtags Visual Summarization of Social Media Events using Mid-Level Visual Elements Sonal Goel (IIIT-Delhi), Sarthak Ahuja (IBM Research, India),
Navi 下一步工作的设想 郑 亮 6.6.
Automatic Detection of Causal Relations for Question Answering
Enriching Taxonomies With Functional Domain Knowledge
An Approach to Abstractive Multi-Entity Summarization
Learning to Rank Typed Graph Walks: Local and Global Approaches
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
Entity Linking Survey
Information Retrieval
Presentation transcript:

Summarizing Entities: A Survey Report Shruti Chhabra ; and Srikanta Bedathur 徐丹云

Definition Resources Summarizing entities task Various documents Knowledge bases Summarizing entities task Extract information associated with entities from these sources Produce summaries

Categories Output Summary Type Number of Input Entities Single Entity Entity Pair Multiple Entities Textual Paragraph (biographical, definitional, domain-specific), Wiki-style Textual relations, Evidence trails Evidence trails Structural Property-value pair ranking, Structure extraction Relation extraction, Structure extraction

Textual-Single Entity Entity description “... Reuters reported that the radical Muslim group Hamas has claimed responsibility for the bombing act...“ noun phrases (NPs) Islamic Resistance Movement Hamas Paragraph summaries Wiki-style summaries Wikipedia

Textual-Single Entity-paragraph Generic Summarizaiton Biographical Summarization “Who is X” Use a top-down approach to search for patterns specific for biographies Utilize bottom up approach to further filter sentences which are biographical in nature Definitional queries “Who is X” and “What is X” Domain-specific Financial(company-specific entities) and geo-locational Distinctive characteristics

Textual-Single Entity-paragraph Generic Summarizaiton Topic signature[57], Conroy et al.[27] Graph based LexRank[34] Topic-sensitive version of LexRank, Otterbacher et al.[62]

Textual-Single Entity-paragraph net centroid value of the sentence position of the sentence in the document similarity with the first sentence LexRank QueryPhraseMatch sentence length tf.idf score similarity with the headline of the document Weighted frequency of entity alias or mention as pronoun Frequency of cue words associated with facets of a person frequency of words usually found with query entity

Textual-Single Entity-paragraph

Textual-Single Entity-Wiki Style distinctly presenting the various aspects of entity written in well structured manner Techniques learning templates Select content to fill these templates(Sauper and Barzilay [67], Autopedia system[92]) Selecting facts similar to infoboxes(Liu et al. [58])

Textual-Entity Pair find direct and indirect relations between entity pair Textual Relations(direct) Patty, Open IE the entities with corresponding relational pattern providing contextual evidence for the relationship between entities. Evidence Trails(indirect) sequence of sentences

Textual-Multiple Entity a connected graph for any number of inputs(Srihari et al. [70]) a concept chain graph (CCG) was built by processing a corpus graph matching technique ------ finding minimum steiner tree in CCG.

Structural-Single Entity Property-Value pair Ranking(1-hop) Structure Extraction System technique Frequency informativeness diversity popularity Cheng et al.[23] Random walk  Thalhammer et al. [81] Sydow et al. [78] Greedy algrithom Sydow et al. [75]

Structural-Entity Pair Relation Extraction facts relating given entities rule based, feature based and kernel based methods Structure Extraction paths or subgraphs

Structural-Entity Pair-Structure Extraction System Main idea Technique Graph Theory Faloutsos et al. [36] a connected subgraph, edge-weighted undirected graph, model the graph as a circuit maximizing the goodness function greedy heuristic Relational Databases (keyword search) Agrawal et al.[1] Symbol table: analogous to inverted lists in IR BANKS[12] Current single source shortest path algorithms from each source; Find root nodes connecting all the source nodes Knowledge Discovery Halaschek et al. [44] Paths between two entities rank list of relationship paths Anyanwu and Sheth [8] a path from each of these entities which meet at a common node a m-length path from entity such that the parent class of each node in one path is same as the parent class of corresponding node in other path

Structural-Multiple Entities mine an important subgraph for query nodes Tong and Faloutsos [82] retrieved nodes in a graph closely connected to query node and thus mined an important subgraph for query nodes. Kasneci et al. [52] Obtain steiner tree from the subgraph using STAR [53] algorithm

Comparison Textual Summaries Structural Summaries source Textual - documents Structural – knowledge bases information Rich in context poor cleanliness noisy Noise-free processable difficultly easily understandable Leverage the advantages of both text and structure Use knowledge bases to obtain connections among entities forming structural summaries augment with text to form textual summaries

Thanks