An Approach to Abstractive Multi-Entity Summarization

Slides:



Advertisements
Similar presentations
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Advertisements

Query Chain Focused Summarization Tal Baumel, Rafi Cohen, Michael Elhadad Jan 2014.
CSCI 465 D ata Communications and Networks Lecture 20 Martin van Bommel CSCI 465 Data Communications & Networks 1.
Junction Trees And Belief Propagation. Junction Trees: Motivation What if we want to compute all marginals, not just one? Doing variable elimination for.
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Clustering Unsupervised learning Generating “classes”
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
LAN Overview (part 2) CSE 3213 Fall April 2017.
Graph Coalition Structure Generation Maria Polukarov University of Southampton Joint work with Tom Voice and Nick Jennings HUJI, 25 th September 2011.
Grant Number: IIS Institution of PI: Arizona State University PIs: Zoé Lacroix Title: Collaborative Research: Semantic Map of Biological Data.
Need for Semantics  Now models represent just the object appearance  We need to represent also its  Properties  Roles  Behaviour  Services  … 
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Web Document Clustering: A Feasibility Demonstration Oren Zamir and Oren Etzioni, SIGIR, 1998.
EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data Cuoliang Li, Beng Chin Ooi, Jianhua Feng, Jianyong.
An Internet of Things: People, Processes, and Products in the Spotfire Cloud Library Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Christopher Moh 2005 Competition Programming Analyzing and Solving problems.
Event-Centric Summary Generation Lucy Vanderwende, Michele Banko and Arul Menezes One Microsoft Way, WA, USA DUC 2004.
Algorithmic Detection of Semantic Similarity WWW 2005.
CS 8751 ML & KDDData Clustering1 Clustering Unsupervised learning Generating “classes” Distance/similarity measures Agglomerative methods Divisive methods.
Automatic Labeling of Multinomial Topic Models
Ch. Eick: Some Ideas for Task4 Project2 Ideas on Creating Summaries that Characterize Clustering Results Focus: Primary Focus Cluster Summarization (what.
Graph Indexing From managing and mining graph data.
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Ontology Evaluation Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches.
Bump hunting In The Dark: Local Discrepancy Maximization on Graphs
Yiting Xia, T. S. Eugene Ng Rice University
Data Mining – Intro.
Hanan Ayad Supervisor Prof. Mohamed Kamel
Multiple approaches to archival description
Saisai Gong, Wei Hu, Yuzhong Qu
Chapter 6: Design of Expert Systems
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Video Summarization by Spatial-Temporal Graph Optimization
Summarizing Entities: A Survey Report
Large Graph Mining: Power Tools and a Practitioner’s guide
Ontology Partition for Browsing
Hierarchical clustering approaches for high-throughput data
Gong Cheng, Yanan Zhang, and Yuzhong Qu
Data Warehousing and Data Mining
CS 430: Information Discovery
Linked Open Data Project
Navi 下一步工作的设想 郑 亮 6.6.
Adaptive entity resolution with human computation
Type-directed Topic Segmentation of Entity Descriptions
Disambiguation Algorithm for People Search on the Web
An Interactive Approach to Collectively Resolving URI Coreference
RDF graph summaries 金成 2014/11/3.
Qingxia Liu Interactive Hierarchical Tag Clouds for Summarizing Spatiotemporal Social Contents [ICDE 2014] Kang, Wei, Anthony KH Tung,
Subgraphs, Connected Components, Spanning Trees
CMNS 110: Term paper research
Towards Exploratory Relationship Search: A Clustering-Based Approach
Graphs and Algorithms (2MMD30)
Information Networks: State of the Art
Navigation-Aided Retrieval
Leverage Consensus Partition for Domain-Specific Entity Coreference
Gong Cheng,Danyun Xu,Yuzhong Qu
Text Categorization Berlin Chen 2003 Reference:
Facilitating Navigation on Linked Data through Top-K Link Patterns
Claim 1: Concepts and Procedures
Introduction Dataset search
Connecting the Dots Between News Article
Thesis Presentations MSCS 491.
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

An Approach to Abstractive Multi-Entity Summarization 龚赛赛 2015-06-23

Contents Background and Motivation Related Work Method Framework

Background Multi-entity summarization: distill the major information from a set of entities to generate a compressed summary Useful for many tasks, including entity search and entity browsing Potential for QA (cp. multi-document summarization for QA)

Motivation Existing works mainly generate extractive summaries for an entity set, usually an entity pair, to find certain similarities, differences or identity information Extract a subset of sentences/features/property-value pairs for different entities Not efficient for identifying similarities and differences for a set of more than 2 entities

Motivation Example Better summary using count All the 23 persons affiliation Websoft 3 status teachers, 5 status phdstudents, … 15 of 23 from jiangsu 3 deliver www paper … …… Affiliation:websoft …… Affiliation:websoft Affiliation:websoft Affiliation:websoft status: Professor status : Professor status: AP status: phd student status: phd student …… …… From: jiangsu From: jiangsu From: jiangsu From: jiangsu From: jiangsu Deliver www paper Deliver www paper Deliver www paper

Motivation Generate abstractive summary for finding commonality (similarity) of the entire set and particularity(difference) of each entity Using count Using subclass,subproperty reasoning Using literal value partitioning

Summary UI Commonality panel Difference panel (focus on difference) … All the 23 persons affiliation Websoft 3 status teachers, 5 status phdstudents, … 15 of 23 from jiangsu … Deliver 2 www papers Leader

Related work Single entity summarization ([1,3]) Find important information Based on centrality, relatedness, informativeness, uniqueness and so on Consider diversity using linear programming, MMR or clustering Entity pair summarization ([2]) Identify similarities and differences based on uniqueness and property-value similarity/dissimilarity Consider diversity Entity set ([4]) Identify uniqueness, differences and contextual relatedness for facilitating entity linking Structured summary Path for entity pairs [6], Steiner tree for input entities [5], connected subgraph for input entities [7] Not condensable

Method Framework Two summaries: Commonality summary for identifying the major common information of the set (entire vs subset), and single summary for identifying the particularity of a single entity Single summary: extract top-K property-value pairs Informal commonality summary statement ? Given the entity set with their descriptions, generate top-K “facets”? Facets allowing entity num count, literal value interval ? A facet consists of a property with its value combination? 20 students. 5 are of phd students. 9 are of msc students. 5 persons age 23-26 . 3 persons age 28+ .

Commonality Summary Value combination e.g. For types, get superclasses. Select suitable classes to group/partition entities For literals, find suitable intervals to group/partition entities Handle equivalent/synonymous props Replace to a representative one Basic idea for generating top-k facets Covered entity number The group number of entities inducing from the facet Property importance

Single Summary Remove property-value pairs that are similar to the facets in commonality summary Extract property-value pairs as summary based on informativeness and uniqueness.

Evaluation Plan Entity set: an entity with its neighbors Invite expert to make guided summary Compare automatic summary with reference

Evaluation Plan

Reference FACES: Diversity-Aware Entity Summarization using Incremental Hierarchical Conceptual Clustering. aaai Facilitating human intervention in coreference resolution with comparative entity summaries. eswc Relin: relatedness and informativeness-based centrality for entity summarization. iswc Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking. www STAR: Steiner Tree Approximation in Relationship-Graphs Rex:Explaining relationships between entity pairs Fast discovery of connection subgraphs